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FOREWORD 


This  Interim  Technical  Documentary  Report  was  prepared  by  personnel 
of  the  Applied  Research  and  Management  Sciences  Division  of  C-E-I-R,  INC., 
including  Dr.  W.  R.  Blischke,  Mr*  A.  J.  Truelove  and  Mr.  P.  B.  Mundle  of 
the  Los  Angeles  Center  and  Dr.  11.  V.  Tohns,  Jr.  of  the  Los  Altos  Office. 

The  research  reported  herein  was  conducted  under  Contract  Number  AF  33(615)- 
3152;  more  complete  results  of  this  continuing  investigation  will  be  re¬ 
ported  in  a  Final  Technical  Documentary  Report.  This  contract  is  a  part 
of  Project  7071,  Research  in  Applied  Mathematics.  The  contract  monitor 
is  Dr.  H.  Leon  Harter,  of  the  Aerosp  ice  Research  Laboratories.  His  valu¬ 
able  suggestions  and  continued  inteest  in  and  encouragement  of  this  re¬ 
search  are  very  much  appreciated. 
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ABSTRACT 


The  prolect  is  a  continuation  of  research  on  problems  in  non¬ 
regular  estimation  reported  in  ARL  Technical  Documentary  Report  No. 
nKi.  65-177(1963).  Included  in  that  report  was  a  lower  bound  on  the 
variance  of  unbiased  estimators  of  the  location  parameter  of  the  Pearson 
Type  III  distribution,  applicable  in  the  non-regular  case.  This  report 
Includes  the  results  of  a  numerical  investigation  of  that  bound  for 
varying  values  of  the  shape  parameter  of  the  Type  III  distribution  and 
varying  sample  sizes.  The  bound  is  apparently  of  the  correct  order  of 
magnitude  in  a  certain  region  of  the  parameter  space  but  sub-optimal 
elsewhere.  Approximations  to  the  Pitman  estimators  for  location  param¬ 
eters  are  investigated  for  both  the  Pearson  Type  III  and  Ueibull  distri¬ 
butions.  In  both  cases,  the  minimum  observation  apparently  contains 
the  major  part  of  the  information  concerning  the  unknown  location 
parameter.  Some  results  on  the  non-regular  estimation  problem,  particu¬ 
larly  concerning  the  derivation  of  variance  bounds,  in  the  cases  of 
densities  with  bounded  domain  depending  on  an  unknown  parameter  and  of 
mixtures  of  uniform  distributions,  are  also  discussed. 
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1.  INTRODUCTION  AND  SUMMARY 


An  estimation  problem  in  which  the  conditions,  on  the  underlying 
probability  distributions,  given  by  eraser  [  6,  Section  33.3]  are  not 
satisfied  is  called  a  problem  in  non-regular  estimation*  It  is  from 
conditions  such  as  those  given  by  Cramer  that  follow  the  well-known 
asymptotic  properties  of  maximum  likelihood  estimators  and  of  the  large 
class  of  estimators,  known  as  BAN  estimators,  which  are  asymptotically 
equivalent  to  maximum  likelihood.  When  the  regularity  conditions  are 
not  satisfied,  it  often  happens  that  the  estimation  problem  is  not  amen¬ 
able  to  any  of  the  standard  approaches  which  might  provide  st  least  a 
straightforward  asymptotic  solution  such  as  that  provided  by  the  theory 
of  maximum  likelihood  in  the  regular  case.  In  such  situations,  problems 
of  considerable  analytical  complexity  are  encountered. 

In  a  previous  work  [2],  investigations  of  several  aspects  of  the 
problem  of  non-regular  estimation,  including  a  number  in  the  latter 
category,  were  reported.  This  report  is  concerned  with  additional  re¬ 
sults  on  non-regular  estimation.  Including  continuations  of  some  studies 
initiated  under  the  previous  project  as  well  as  some  new  studies.  As 
reflected  in  the  title  of  this  report,  the  major  part  of  the  effort  in 
this  project,  and  consequently  the  majority  of  the  results,  are  concerned 
with  estimation  of  the  location  parameter,  in  the  non-regular  case,  of 
the  Pearson  Type  III  and  Welbull  distributions. 

In  the  regular  case,  the  BAN  estimators  are  consistent,  asymptoti¬ 
cally  normally  distributed  and  asymptotically  efficient  in  th  sense 
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that  no  other  asymptotically  normal  estimator  1  an  asymptotic  distri¬ 
bution  with  smaller  variance.  Some  or  all  of  these  results  may  fail  to 
hold  in  the  non-regular  case.  This  is  true  for  a  non-trivial  subset  of 
the  prraroeter  space  for  both  the  distributions  of  interest  to  this  in¬ 
vestigation.  In  fact,  in  certain  regions  the  likelihood  function  is 
unbounded.  One  roust,  therefore,  necessarily  seek  alternative  estimators. 

In  this  search  we  use  the  property  of  minimum  variance  as  our  criterion 
of  optimality,  although  it  is  recognized  that  this  choice  is  subject  to 
criticism  in  the  absence  of  at  least  asymptotic  normality. 

In  attempting  to  construct  minimum  variance  estimators  for  location 
parameters  of  the  Weibull  and  Type  III  distributions,  it  was  discovered 
that  not  only  did  the  regularity  conditions  not  hold,  but  most  of  the  standard 
techniques  for  constructing  lower  bounds  on  the  variance  of  estimators  led 
to  trivial  results.  *.n  general,  except  for  a  few  special  cases,  for  exam¬ 
ple  cases  where  a  complete  sufficient  statistic  exists,  this  further 
complicates  the  estimation  problem.  A  substantial  part  of  our  previous 
effort  [2]  was  devoted  to  the  construction  of  new  bounds  which  would 
yield  non-trivial  results  for  the  non-regular  case  of  the  Weibull  and 
Type  III  distributions.  The  bounds  obtained  were  found  to  be  analytically 
quite  complex.  For  this  reason,  a  numerical  investigation  of  the  bounds 
was  initiated  for  the  Type  III  distribution.  This  investigation  has  been 
extended  considerably.  The  results,  to  be  discussed  in  detail  below, 
are  somewhat  mixed.  It  appears  that  the  bound  is  quite  good,  i.e. .  is 
essentially  attainable  for  a  part  of  the  parameter  space  but,  while 
non-trivia',  is  also  non-optimal  elsewhere. 

Some  additional  analytical  results  concerning  lower  bounds  on  the 
variance  for  the  Type  III  distribution  are  also  given.  These  include 
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the  derivation  of  a  generalization  of  the  bound  given  previously.  The 
generalization  is  even  more  complex  than  the  original  and  has  not  been 
investigated  numerically. 

An  investigation  of  the  estimation  problem  itself  has  also  been 
initiated.  The  approach  pursued  is  to  approximate  an  estimator  proposed 
by  Pitman  [ll].  The  Pitman  estimator  of  a  location  parameter,  although 
known  to  be  optimal  in  a  number  of  respects,  including  minimum  variance, 
is  quite  intractable  for  the  distribution  in  question.  Thus  an  analyti¬ 
cal  investigation  requires  some  form  of  approximation.  The  approxima¬ 
tions  used  appear  to  yield  quite  reasonable  results. 

A  similar  investigation  of  the  Pitman  estimation  technique  has  been 
initiated  in  the  case  of  the  Weibull  distribution.  In  the  Weibull  case, 
since  exact  moments  of  the  order  statistics  are  available,  certain 
approximations  in  the  derivation  of  the  Pitman-type  estimator,  necessary 
in  &he  Type  III  ease,  can  be  avoided.  Preliminary  results  indicate  that 

the  Weibull  case  is  quite  similar  to  the  Pearson  Type  III. 

Finally,  some  miscellaneous  additional  results  on  the  construction 

of  variance  bounds  are  discussed.  These  include  bounds  for  densities 
with  finite  domain  and  specifically  for  mixtures  of  rectangular  distribu¬ 
tions. 
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2.  VARIANCE  BOUNDS  FOR  ESTIMATORS  OF  THE  LOCATION  PARAMETER 
OF  THE  PEARSON  TYPE  III  DISTRIBUTION 


Let  Xj,*..,  Xq  be  independent  random  variables,  each  having  a 


Pearson  Type  III  distribution 


(2.1) 


£«-gfk(T) 


-(x-a)/0 


x  >  a 


■  0  otherwise, 

where  -00  <  a  <  00  and  0  <  cr,  0  <  ®,  It  is  assumed  that  the  scale  parameter, 
0,  ana  the  shape  parameter,  a,  are  known.  The  problem  is  to  estimate  the 
location  parameter,  a,  in  the  non- regular  case,  that  is,  when  <*  =  2.  In 
the  ensuing  discussion  Y^,...,  Yq  will  be  taken  to  be  order  statistics  of  a 
sample  of  size  n  from  the  Type  III  distribution. 

Before  considering  the  problem  of  constructing  estimators  for  a, 
we  shall  present  some  results,  mostly  numerical,  concerning  lower  bounds 
on  the  variance  of  such  estimators.  We  begin  with  a  brief  suumary  of 
previous  work  on  this  problem  which  was  reported  in  reference  [ 2  ] . 

2. 1  Previous  Results 

Since  in  the  non-regular  case  of  the  Pearson  Type  III  distribution 


(2.2) 


a  d  a2 


the  Cramer-Rao  bound  becomes  the  trivial  inequality  V(t)  5  0,  where  t 

is  any  unbiased  estimator  of  a.  Alternative  methods  for  obtaining  lower 

bounds  on  the  variance  must  therefore  be  investigated. 

Blischke,  et  al.  [2  ]  discussed  application  of  several  alternatives 

to  the  problem  at  hand.  The  notation  used  is  as  follows;  X X 

1  n 
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are  assumed  to  be  identically  distributed  random  variables  with  common 
density  f(x,0),  where  0  *  (0  ,...,0  )  is  an  s-dimensional  parameter  with 

JL  8 

0.  unknown.  The  function  t  ■  t(X,,...,X  )  is  an  unbiased  estimator  of  0, . 
i  in  I 

(Although  this  discussion  is  limited  to  unbiased  estimators,  the  results 
can  be  generalized  in  an  obvious  way  to  yield  lower  bounds  on  the  mean 
square  error  of  biased  estimators.  In  fact,  in  the  sequel  we  shall  not 
be  particularly  concerned  with  the  question  of  bias.  Since  the  generali¬ 
zation  is  obvious,  we  shall  avoid  unnecessary  complications  by  consider¬ 
ing  only  the  unbiased  case  at  present.)  The  density  f  is  assumed  to  be¬ 
long  to  some  family  of  densities,  ? ,  indexed  by  the  parameter  0  belonging 
to  a  set  0.  We  define 

(2.3)  H  =  {hj (01+h,02> . . . ,0g)  e  @}  , 

(2.4)  P  =  |p|  there  exists  a  function  k(0)  such  that 

k(6)fP(x,0)  e  ?}, 

and 

(2.5)  H<*P  -  |(h,p)  |kfP(x,0+-h)  e  ?  for  some  kj, 

where  h  *  (h,0,...,0)  e  E®,  i.e.,  &fh  -  (0.+h,0o , . . . ,0  ),  we  write 
y(0)  for  that  function  of  0  for  which  k(0)fP(x,0)  ■  f(x,y(6))  and  assume 
that  y(9^> • • • »0g)  "  (ei’e2’  "  * for  a11  6  6  ®-  Finally,  Uj  and  p2 
are  any  probability  measures  on  H  such  that  hdjj^(h)  <  00  and 

E  =  J  hdu  (h)  <  ®. 

H  1 

The  bounds  discussed  previously  included  those  given  by  Chapman  and 
Robbins  [4  ],  Fraser  and  Guttman  [7],  and  Kiefer  [9].  The  Chapman- 
Robbins  bound  is 
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(2.6) 


5  f2(xt,»A) 

1-1  “(Xj.Q) 


n 

n  dx 

i-l  i 


The  bound  derived  by  Fraser  and  Guttman  is 


(2.7)  V(t)  £ 


inf 

cl»* *  * »cr 

★ 

wh«H 


n  f(*<,e> 

i-l  1 


i  _ 


where  are  non-negative  and  sum  to  unity,  and  -  {h|jh  e  H 

for  J  -  1 .  rj.  Kiefer  gives  the  result 


(2.8)  V(t)  ?  su 


(Ejh  -  S2h)' 


n  fOcl,0) 


11  dx 


‘ J 


where  the  aupreraura  is  taken  over  all  measures  for  which  the 

integrals  are  d6fined. 

A  discussion  in  which  these  and  the  ensuing  bounds  are  compared  and 
applied  to  several  distributions  is  given  in  reference  L  2 ] •  For  the 
Pearson  Type  III  distribution  the  Chapman-Robbins  and  Fraser-Guttman 
bounds  yield  trivial  results  for  y  5  1/2  and,  except  for  a  limiting  form 
of  the  Fraser-Guttman  bound  when  a  *  1,  are  less  than  the  optimal  bound 
for  1/2  <df  5  2.  The  Keifer  bound,  although  proved  by  E?rankin  L  1  ]  to 
be  optimal  under  certain  conditions,  is  essentially  an  existence  theorem 
in  the  sense  that  it  does  not  provide  an  applicable  analytical  technique 
for  construction  of  a  bound. 

Two  additional  bounds  were  developed  in  reference  [2]  in  an  attempt 
to  obtain  applicable  non-trlvial  bounds  for  the  entire  range  of  a  in  the 
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sap 


Type  III  distribution.  These  are 


(2.9)  V(t)  *  sup 


and 


(2.10) 


V(t) 


inf 

Cl’“'Cr 
h,p*H*  •] 


k2"(9)  cJ[nfp(xt,»f3h)  -  nfp(xt,»f(j-i)h)] 


h 


■J 


nf^.e) 


Udx. 


where 

(2.11)  H*®P  ®  { (h,p) jkfP(x,ftf jh)  e  ?  for  some  h  and  all  J  -  0,...,rj. 

The  latter  two  inequalities  do  yield  non^trivial  inequalities  for  all  or. 
In  practice,  however,  considerable  analytical  and  numerical  difficulties 
are  encountered.  The  details  of  the  application  of  Inequality  (2.9)  only 
were  given  previously.  Application  of  inequality  (2.10)  will  be  the 
subject  of  Section  2.3  below. 

Note  that  for  the  Pearson  Type  III  distribution  H  ■  {h|0<h<00}, 

P  -  {  p  1 1  / 2<p<q (or ) }  ,  where 

(2.12)  q(<>)  -  2([— )  0  <Qf  <  1 

■  00  a  -  1, 

H*P  is  the  Cartesian  product  of  H  and  P,  and 

(2.13)  t(9)  ■  . 

ei_pr(pa-pfi) 

2 . 2  numerical  Methods  for  Investigation  of  the  Variance  Bound 

It  has  been  shown  that  application  of  inequality  (2.9)  to  the  Type 
III  distribution  yields  the  inequality 
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where 


(2.15)  g(b;a,c)  -  J  y^ty+b)"0  e'y  dy. 

0 

Because  the  above  bound  is  analytically  quite  intractable,  a  numerical 
investigation  was  initiated.  This  investigation  involves  numerical 
Integration  of  the  function  g  and  utilizes  a  modification  of  a  method 
known  as  the  "Single"  procedure  for  the  steepest-ascent  method  described 
by  Brooks  [3  ]  in  searching  for  the  supremum  on  the  right-hand  side  of 
inequality  (2.14).  Some  preliminary  results  were  given  in  reference  [2  ]. 

An  early  version  of  the  computer  program  to  calculate  the  lower  bound 
of  inequality  (2.14)  was  described  in  detail  in  [2  ].  in  Section  B3  of 
Appendix  B  of  that  report,  certain  »odif ications  to  the  program  were 
proposed  with  a  view  to  providing  greater  efficiency  of  table  generation, 
and  to  dealing  with  certain  convergence  problems  that  had  been  troublesome. 
Several  of  these  modifications  have  been  implemented.  In  addition,  sub¬ 
sequent  difficulties  encountered  in  the  investigation  have  necessitated 
further  changes  and  improvements.  The  following  additional  features  have 
been  introduced  into  the  program  described  in  [2  ]: 

Storage  of  tables  of  auxiliary  functions,  g(b,a,c).  The  most  time- 
consuming  feature  of  the  program  i6  the  numerical  integration  required  to 
evaluate  the  function  g(b;a,c)  given  in  equation  (2.15).  It  is  therefore 
desirable  to  store  values  of  this  function  as  they  are  generated,  and  to 
use  table  look-up  and  interpolation  as  much  as  possible  in  subsequent  cal¬ 
culations. 
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The  previous  version  of  the  program  was  modified  so  that  the  tabu¬ 
lated  values  are  stored  efficiently.  The  tabulated  values  corresponding 
to  different  values  of  the  parameter  a  are  maintained  in  separate  card 
decks.  Thus,  on  any  particular  run,  only  those  decks  for  the  or -values 
used  in  this  run  need  be  read  in.  This  permits  all  calculations  to  be 
carried  out  in  core.  At  the  same  time,  the  search  procedure  was  improved. 

This  feature  Increases  the  efficiency  in  two  ways.  Firstly  the 
same  grid  of  tabulated  points  can  be  used  for  all  values  of  sample 
size  n,  for  the  same  value  of  the  distribution  parameters.  Considerable 
overlap  occurs  in  the  maximum  seeking  paths.  Secondly,  if  sufficient 
convergence  has  not  been  obtained  by  a  specified  number  of  steps  of  the 
procedure  prior  to  cut-off  in  a  given  run,  the  search  can  be  continued 
from  this  point  at  the  next  run,  without  the  need  for  recomputing  values 
of  the  g- function.  Furthermore,  if  numerical  procedures  are  ever  applied 
to  the  Fraser-Gut tman- type  bounds  (to  be  discussed  in  the  next  section), 
the  tables  already  generated  for  the  present  procedure  will  cover  a  sub¬ 
stantial  fraction  of  the  numerical  integration  required. 

The  table- interpolation  device  was  found  to  be  of  greatest  use  in 
the  region  p  <  1.  In  the  region  p?  1,  exact  calculations  were  needed 
immediately.  This  is  due  to  the  fact  that  some  of  the  quantities  become 
critical  in  this  area,  and  the  interpolation,  with  interval  0  'll  for  both 
p  and  h,  is  of  little  help  in  the  search  procedure  and  can,  in  fact,  lead 
away  from  the  value  sought. 

In  cases  where  the  Interpolation  method  was  used,  at  least  one 
iteration  was  perforried  using  the  exact  method  to  conclude  the  search 
procedure.  This  doe*  not  help  appreciably  in  determining  the  (p,n)  values, 
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but  does  provide  an  exact  value  of  the  variance  bound  at  a  point  very 
close  to  the  true  maximum. 

It  should  be  noted  that  the  exact  method  is  very  much  more  time- 
consuming  than  the  table- Interpolation  method  (by  a  factor  of  10  or 
greater). 

Reduction  of  step-slxe.  The  original  program  Involved  a  maximum- 
seeking  method  for  the  (p,h)-comblnat ion  at  which  the  maximum  value  of 
the  bound  occurs.  A  2x2  design  is  used  at  an  arbitrary  starting  point, 
and  the  gradient  of  the  surface  is  estimated.  A  step  of  predetermined 
length  is  made  in  this  direction,  and  the  step  is  repeated  as  long  as 
improvement  in  the  bound  occurs.  When  no  improvement  occurs,  a  new  2x2 
design  is  used.  The  question  of  when  the  step  size  should  be  reduced 
was  treated  as  follows:  If  the  gradient  previously  used  was  less  than 
a  predetermined  constant,  0.1  say,  the  new  step  size  is  set  at  0.6  of 
the  old.  If  not,  then  we  continue  with  the  old  step  size. 

This  procedure  has  been  Improved  in  two  ways;  first,  the  new 
gradient  is  used  to  determine  whether  to  reduce  step  size,  and  the  pre¬ 
determined  constant  is  now  an  input  variable  and  hence  can  be  made  de¬ 
pendent  on  the  sample  size  n.  1:  hn  been  conjectured  L  2  J  that  for 

larger  values  of  n,  the  value  of  the  bound  can  be  expected  to  be  of  the 
-2/a 

order  n  .  It  therefore  seemed  reasonable  that  the  value  of  the 
gradient  at  which  we  start  to  reduce  step-size  be  made  proportional 
tot  his . 

The  second  Improvement  involving  a  reduction  in  step  size  concerns 
the  possibility  of  the  sequence  of  steps  crossing  itself  or  going  round 
in  a  circle.  This  was  observed  to  happen  in  early  runs.  It  is  reason¬ 
able,  under  these  circumstances,  to  suppose  thst  we  are  near  the 
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maxinun,  and  that  step  size  should  be  reduced,  regardless  cf  the  current 
value  of  the  gradient.  To  implement  this,  we  examine  ..he  six  previous 
trial  points  at  every  stage.  If  our  current  position  is  within  the 
current  step  size  of  any  of  these,  we  reduce  step  size  by  the  factor  0.6. 

Boundary  constraints.  The  h  parameter  oust  not  be  permitted  to 
become  negative,  since  the  result  of  inequality  (2.14)  is  then  no  longer 
valid.  If  the  next  step  of  the  search  procedure  would  ma^  h  negative, 
we  refrain  from  taking  this  step,  and  instead  perform  the  2x2  design 
segment  of  the  procedure,  centered  on  our  current  position,  after  re¬ 
ducing  our  basic  interval  by  the  factor  0.6.  Furthermore,  if  at  any  time 
the  2x2  design  overlaps  the  h-axis,  we  reduce  the  interval  similarly, 
and  repeat  the  operation.  A  similar  procedure  is  followed  when  the  p 
parameter  nears  its  boundaries,  namely  p  >  0.5,  and  p  <  [2(l-ar)]  *  in 
the  event  that  or  <  1. 

One  further  case  in  which  interval  size  is  reduced  should  be  noted. 
This  occurs  when  the  values  of  the  bound  calculated  at  the  corners  of 
the  2x2  design  all  fall  below  the  value  at  the  center.  This  means  that 
a  maximum  (or  at  least  a  local  maximum)  occurs  within  the  design  square, 
so  we  reduce  interval  size  and  repeat  the  procedure. 

The  case  of  p  ■  1.  The  numerical  method  used  to  obtain  the  bound  is 
also  applicable,  of  course,  when  p  is  set  equal  to  1  in  the  event  that 
a  >  1/2,  L.e.,  when  the  Chapman-Rcbbins  bound  is  applicable.  The  re¬ 
sulting  bound,  In  general,  will  not  be  optimal,  but  it  is  intereating 
to  see  what  effect  this  modification  has. 

Since  p  is  fixed,  we  c an  no  longer  perform  Interpolation  In  the  (p,h) 
plane.  It  would  be  possible  to  perform  interpolation  in  one  dimension, 

namely,  aloog  the  h-axla,  but  It  waa  decided  that,  in  view  of  the  limited 
amount  of  computation  proposed  for  this  specisl  case,  it  would  not  be 
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worth  the  trouble  of  writ  Inc  special  routines  for  this  purpose.  Thus 
the  function  g(b;a,c)  was  calculated  directly  in  each  step.  The 
search  procedure  was  carried  out  using  the  routine  "LARMAX"  (Linear 
Maximization).  We  start  with  a  suitable  value  of  h,  and  a  suitable 
step-size,  Ah,  say.  The  variance  bound  is  calculated  for  three  values 
of  the  h-varlable,  viz.,  h  -  Ah,  h,  and  h  +  Ah,  and  for  two  intermediate 
values,  h  -  (l/2)Ah  and  h  +  (l/2)Ah.  This  pattern  of  five  points  is 
preserved  throughout  the  search  procedure.  The  range  is  then  extended 
in  either  direction,  and/or  the  step-size  is  reduced,  in  such  a  way  that 
the  maximum  is  determined  to  any  required  degree  of  accuracy.  This  will 
lead  to  the  maximum  value  of  the  bound,  assuming  that  it  is  unique. 

Overflow  precautions.  In  computing  the  expression  on  the  right-hand 
side  of  the  Inequality  (2.14),  care  must  be  taken  that  none  of  the  quanti¬ 
ties  exceed  the  floating-point  capability  of  the  computer  (approximately 
38 

10  ).  The  following  feature  provides  for  this.  The  critical  quantities 

are  T  (pof-j>fl)  In  the  numerator,  and  rn(a),  and  the  linear  combination 
•i  n 

of  g‘  of  two  arguments  and  T  (2pQ'-2p-or+2) ,  in  the  denominator.  These 
five  quantities  are  calculated  by  successive  multiplications,  2n  or  n 
times,  as  appropriate. 

When  any  of  these  three  factors  (or  in  the  case  of  the  linear 
combination,  any  of  its  three  components)  exceeds  10^^  during  the  multi¬ 
plication  loop,  the  factor  is  multiplied  by  0.1  a  sufficient  number  of 
times  to  bring  it  below  10*"\  By  keeping  track  of  the  number  of  times 
this  is  done,  we  can  re- insert  the  factor  into  the  final  result,  or,  if 
this  would  exceed  capacity,  print  the  factor  separately. 

The  final  form  of  the  computer  program  used  in  the  numerical  investi¬ 
gation  of  the  bound  of  inequality  (2.9)  is  given  in  the  Appendix.  Included 
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in  the  Appendix  are  a  brief  discussion,  including  a  flow-chart  and 
sample  input  sheet,  and  a  complete  listing  of  the  FORTRAN  statements 
of  the  program.  The  program,  in  its  present  form, has  enabled  us  to  in¬ 
vestigate  quite  efficiently  the  lower  bound  of  Inequality  (2.14)  for 
several  values  of  n  and  several  values  of  or.  The  next  section  is  con¬ 
cerned  with  the  results  of  this  investigation. 


We  note  that  some  possible  improvements  for  general  search  tech¬ 
niques  in  two  (or  more)  dimensions  are  suggested  by  experience  in  thlB 
problem.  To  our  knowledge,  these  have  not  been  considered  in  the 
literature.  In  particular,  the  ptper  by  S.  Brooks  L^l,  on  which  this 
technique  was  based,  does  not  consider  them. 

The  difficulty  arises  in  the  arbitrary  choice  of  the  initial  step- 
size,  or  in  its  arbitrary  reduction  by  a  factor  of  0.6  when  a  search 
path  reaches  a  "dead  end"  (i.e.,  no  improvement  over  the  previous  maximum). 
It  will  be  recalled  that  this  reduction  is  effected  only  if  the  current 
gradient  of  the  path  is  less  than  an  assigned  constant.  However,  it  can 
happen  that  a  point  quite  close  to  the  maximum  is  reached,  and  the  next 
step  takes  us  away  from  this  maximum.  To  avoid  this,  it  is  suggested 
that  we  examine  the  function  values  vi«v2»v3»v4  f°r  c^e  8fUnPle  points  of 
the  experimental  design,  and  compare  them  with  the  value  v&  already 
attained. 

If  all  values  v^,  i  ■  1  to  4,  are  less  than  vo>  then  obviously  we 
must  have  a  local  maximum  in  the  vicinity  of  the  point  with  value  vq. 

There  is  no  point  in  taking  a  step  based  on  this  configuration,  since  we 


should  then  get  further  way  from  the  maximum*  The  design  should  thus 
be  reduced  in  size  until  at  least  one  point  has  value  v  exceeding  vq. 
Alternatively,  the  design  could  be  compressed  until  all  four  points  ex¬ 
ceed  a  fixed  fraction  (e.g.  0.9)  of  the  value  vq.  The  former  procedure 
was  found  to  be  effective  in  this  study. 

2.3  Numerical  Results 

A  number  of  runs  were  made  for  sample  sizes  1,  11,  21,  31,  41,  51, 

71,  91  and  131  and  fora  -  .25,  .5,  .51,  .60,  .75,  1.00,  1.25,  1.5,  2.0, 
3.0,  and  5.0.  As  indicated  above,  the  calculations  proceed  as  follows. 

For  each  Alpha- value,  a  library  deck  is  read  in.  This  provides  a  tabu¬ 
lation  of  values  of  tuo  functions,  g^  and  g^,  having  p,  h  as  arguments, 
for  values  of  p,  h  in  the  range  p  =  0.0(0.01)  1.50,  h  **  0.00(0.01)... 
without  limit.  The  calculations  for  various  sample  sizes  are  then  made, 
using  tabulated  function- values  where  available,  and  when  appropriate, 
and  computing  and  storing  them  when  they  are  not.  When  all  calculations 
for  this  value  of  a  are  complete,  the  tables  are  sorted  internally,  and 
a  new  library  deck  is  punched  out  on  line. 

Substantial  library  decks  have  been  accumulated  for  most  of  the 
parameter  values;  in  fact,  these  tabulations  cover  most  of  the  grid  points 
that  would  be  needed  for  any  sample  size  calculation  in  the  range  1  to  100, 
for  the  parameter  values  listed.  An  indication  of  the  saving  in  computer 
time  was  provided  by  comparing  runs  using  substantially  complete  library 
decks  with  those  where  no  prior  values  were  known.  A  rough  estimate  is 
that  run  time  is  cut  to  one  fifth  or  one  tenth  by  the  library  deck  feature. 

All  of  the  results  to  date  are  summarized  in  Table  2.1.  The  table 

2 

includes  the  maximizing  points  nh  and  p,  n  times  the  maximum  and,  for 
2/or 

a  '  2,  n  times  the  maximum.  For  completeness  the  table  also  gives 
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TABLE  2.1 


SUMMARY  Of  VARIANCE  BOUND  NUMERICAL  RESULTS 


a 


.25 


.5 


.51 


.60 


.75 


1.00 


*  (-x) 


n 

nh 

P 

2„ 

n  Var 

2  /a 

n  Var 

1 

0.8006 

0.540 

0.0757 

.0757 

11 

0.2756 

0.624 

0.  7573  (-7)* 

.1345 

21 

0.1036 

0.626 

0.3414  (-11) 

.0027  (-1) 

51 

0.0264 

0.656 

0.4217  (-22) 

.7420  (-12) 

91 

0.0036 

0.660 

0.2452  (-30) 

.1392  (-18) 

1 

0.8230 

0.592 

0.2960 

0.2960 

11 

0.5090 

0.764 

0.3430  (-2) 

0.4150 

21 

0.1395 

0.886 

0.2011  (-3) 

0.0887 

31 

0.1000 

0.970 

0.1568  (-5) 

0.1507  (-2) 

41 

0.0101 

0.999 

0.4039  (-5) 

0.6790  (-2) 

91 

0.00015 

0.999 

0.9489  (-9) 

0.7858  (-5) 

1 

0.8280 

0.589 

0.2798 

0.2798 

11 

0.5119 

0.775 

0.4312  (-2) 

0.4312 

91 

0.00051 

0.999 

0.1360  (-7) 

0.7906  (-4) 

1 

0.8434 

0.605 

0.3670 

0.3670 

11 

0.6504 

0.824 

0.0220 

0.5381 

91 

0.0749 

0.9988 

0.3356  (-3) 

0.1376 

1 

0.8475 

0.6325 

0.5221 

0.5221 

11 

1.0956 

0.8842 

0.1257 

0.6222 

21 

0.8481 

0.9423 

0.0802 

0.6112 

31 

0.7093 

0.9426 

0.0631 

0.6235 

41 

0.7786 

0.9849 

0.0525 

0.6246 

51 

0.2297 

0.9959 

0.0486 

0.6658 

91 

0.5550 

0.99898 

0.0322 

0.6520 

131 

0.4430 

0.99900 

0.0286 

0.7379 

1 

1.1101 

0.6761 

0.8120 

0.8120 

2 

1.3468 

0.8076 

0.7161 

0.7161 

4 

1.4745 

0.8993 

0.6780 

0.6780 

6 

1.5158 

0.9324 

0.6670 

0.6670 

11 

1.5521 

0.9630 

0.6578 

0.6578 

16 

1.5653 

0.9746 

0.6545 

0.6545 

21 

1.5722 

0.9806 

0.6528 

0.6528 

26 

1.5763 

0.9843 

0.6518 

0.6518 

31 

1.5792 

0.9869 

0.6511 

0.6511 

36 

1.5812 

0.9887 

0.6506 

0.6506 

41 

1.5827 

0.9901 

0.6502 

0.6502 

46 

1.5839 

0.9912 

0.6500 

0.6500 

51 

1.5849 

0.9920 

0.6497 

0.6497 

56 

1.5857 

0.9927 

0.6495 

0.6495 

61 

1.5863 

0.9933 

0.6494 

0.6494 

CO 

1.5936 

1.0000 

0.6476 

0.6476 

_  -X 

~x 

indicates  that  tabulated  value  is  to  be  multiplied  by  10 
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a 

1.25 


1.50 


2.00 


3.00 


5.00 


**  n 


TABLE 

2.1  (Continued) 

2 

2 /a 

n 

nh 

P 

n  Var 

u  Var _ 

1 

1.2396 

0.7103 

1.0981 

1.098 

11 

1.9500 

1.0000 

1.7246 

0.662 

21 

2.1370 

1.0075 

2.1034 

0.622 

31 

2.2417 

1.0081 

2.3963 

0.607 

41 

2.2726 

1.0006 

2.7089 

0.613 

51 

2.3000 

1 . 0000 

2.978 

0.618 

71 

2.5606 

1.0012 

3.3058 

0.601 

91 

2.6000 

1.0000 

3.6463 

0.600 

1 

1.3477 

0. 7409 

1.3784 

1.378 

11 

2.1931 

1.0158 

3.1642 

0.640 

21 

2.4735 

1.0189 

4.3750 

0.575 

31 

2.6792 

1.0170 

5.3629 

0.544 

41 

2.8511 

1.0198 

6.2372 

0.525 

51 

2.9558 

1.0144 

7.0321 

0.511 

71 

3.1779 

1.0117 

8.4537 

0.492 

1 

1.7470 

0.7969 

1.8821 

1.8821 

11 

2.4240 

1.0279 

6.8506 

0.629 

31 

3.0291 

1.0229 

14.2368 

0.448 

51 

3.4251 

1.0186 

20.644 

0.410 

91 

4.1235 

1.0061 

32.158 

0.353 

1 

2.3361 

0.8538 

2.7622 

2.7622** 

11 

2.4934 

1.0391 

15.8955 

1.445 

21 

2.9280 

1.0247 

27.5487 

1.310 

31 

3.2298 

1.0104 

38.6824 

1.248 

41 

3.4855 

1.0010 

49.3972 

1.205 

51 

3.5928 

1.0012 

60.2271 

1.181 

1 

2.9591 

0.9139 

4.45 

4.45 

11 

2.419 

1.022 

36.42 

3.31 

21 

2.611 

1.018 

66.79 

3.18 

Var  from  this  point  on. 
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values  tor  or  *  1  obtained  in  a  previous  study  [2  ].  Although  a  few 

entries  of  the  table  may  not  be  completely  accurate,  some  general 

patterns  are  apparent.  The  results  are  of  considerable  interest, 

although  not  all  are  as  had  been  expected. 

2/cr 

For  1—0  5  2,  the  bound  times  n  appears  to  be  approaching  a 

constant  as  n  increases.  For  a  <  1,  however,  no  such  general  conclusion 

is  apparent.  It  is  quite  evident  that  the  bound  is  of  smaller  order 

than  n  for  the  cases  run  witho  3  .6.  For  cr  ■  .75,  however,  the 

curve  again  appears  to  be  approaching  an  asymptote.  These  results  are 

2/cr 

shown  graphically  in  Figure  2.1,  where  n  times  the  bound  is  plotted 
for  a  5  2  and  n  times  the  bound  for  er  £  2.  Their  apparent  regularity  is 
an  interesting  feature  of  this  pattern  of  curves. 

Note  that  our  investigation  has  included  some  values  of  a  corre¬ 
sponding  to  the  regular  case.  For  Alpha  greater  than  2,  we  know  that 
the  Cramer-Rao  bound  exists  and  that  the  variance  of  the  maximum  likeli¬ 
hood  estimator  asymptotically  achieves  this  bound.  Thus  the  value  of  p 
for  which  the  maximum  is  attained  in  the  above  should  tend  to  1  as  the 
sample  size  increases. 

This  appears  to  be  happening  for  or  "3.0  and  5.0.  For  finite 
samples,  however,  the  Cramer-Rao  bound  is  uot  attained;  this  is  because 
the  criterion 

<2'16>  ,  A(a) 

da 

where  a  is  the  location  parameter  and  A(a)  is  any  function  of  a  alone, 
is  not  satisfied  for  the  Pearson  type  III  distribution.  (See  Kendall 
and  Stuart  [8,  Section  17.17].)  Hence  we  might  expect  to  do  better  than  the 
Cramer-Rao  bound  for  finite  samples;  the  values  given  in  Table  2.1  do, 
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Figure  2-1.  Variance  Bound  Times  li 


for  a  5  2 ,  Times  n  for  a  -  2. 


2  la 
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in  fact,  yield  such  bounds-  This  follows  since  (in  the  regular  case) 
the  Cramer-Rao  bound  is  simply  (a-2)/n  (recill  that  we  have  taken  £  •  1). 
For  a  *■  3  and  5,  respectively,  the  hound  times  n  becomes  simply  1  and  3. 
The  tabulated  values  exceed  these  in  both  cases. 

It  would  also  be  interesting  to  see  whether  these  improved  bounds 
come  close  to  the  actual  variance  for  maximum- likelihood  estimators  in 
the  regular  case.  This  would  establish  the  efficiency  of  these  esti¬ 
mators  for  finite  sample  siz.-'s. 

Some  runs  have  also  been  made  for  a  ■  1.5  with  p  set  equal  to  1. 

This  provides  a  comparison  with  the  Chapman- Robb ins  bound.  The  results 
of  this  study  are  given  in  Table  2.2.  It  is  interesting  that  there  is 
apparently  little  improvement  in  *he  bound  by  the  introduction  of  the 
variable  p  nnd  that  furthermore  as  u  — }  ®  the  two  bounds  appear  to  be 
identical.  (Recall  that  this  result  had  been  proven  only  for  a  ■  1.) 


TABLE  2.2 

COMPARISON  OF  MAXIMUM  VARIANCE  BOUNDS  ATTAINED 
WITH  p  SET  EQUAL  TO  1,  AND  WITH  p  UNRESTRICTED 


or  »  1.5 


n 

nh 

Unrestricted 

2 /a 

p  n  Var 

nh 

£_!1  1 

2/cr 

n  var 

1 

1.35 

0.741 

1.3784 

- 

11 

2.19 

1.015 

.6397 

2.1596 

.6386 

21 

2.47 

1.019 

.5743 

2.4244 

.5719 

31 

2.68 

1.107 

.5439 

2.6195 

.5404 

41 

2.85 

1.020 

.5245 

2.7556 

.5210 

51 

2.96 

1.014 

5113 

- 

- 

71 

3.178 

1.012 

.4930 

- 
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In  Figures  2.2  and  2.3  the  values  of  p  and  h  at  which  the  maximum 
Is  attained  are  shown  for  the  non-regular  and  regular  cases,  respectively. 
The  solid  lines  correspond  to  a  ■  0.25,  0.5,  0.6,  0.75,  1.00,  1.25,  and 
2*0,  In  Figure  2.2  and  to  cr  ■  3.0  and  5.0  In  Figure  2.3.  The  dashed 
lines  correspond  to  n  «  1,  11,  21,  31,  41  and  51.  (To  preserve  clarity, 
the  lines  for  sample  sizes  71,  91  and  131  have  not  been  drawn).  The 
scale  used  in  Figures  2.2  and  2.3  Is  a  logarithmic  one  on  which  1.03 
corresponds  to  unity  on  the  log  scale  and  each  decrement  of  0.01, 
reading  from  right  to  left  corresponds  to  equal  increment  on  the  log 
scale.  This  transformation  provided  greater  clarity  In  th®  region 
where  p  >  1. 

Not  all  the  points  plotted  correspond  to  cases  in  which  the  maximum 
variance  bound  has  been  very  accurately  obtained  (say  to  within  .000001). 
Most  of  them,  however,  are  quite  accurate.  In  a  few  cases,  in  which  it  is 
clear  that  we  are  nowhere  near  the  true  global  maximum,  the  point  has 
been  omitted.  For  example,  this  is  the  case  with  or  ■  1.25  and  n  ■  41 
and  51. 

The  curves  for  cr  ■  .25  and  .5  are  restricted  to  the  regions  p  5  2/3 
and  p  5  1,  respectively,  according  to  the  theory.  In  fact,  it  is  seen 
that  as  the  sample  size  tends  to  infinity,  the  (p,nh)  point  tends  to 
(.666...,0)  and  (1.0,0)  respectively.  In  the  region  .5  5a  5  1.0,  it 
is  conjectured  that  the  limit  points  occur  on  the  axis  p  *  1,  the  curves 
for  y  -  0.5,  0.6  and  0.75  suggesting  this.  It  is  known  that  the  curve  for 
y  *  1  tends  to  (1,1.5936),  again  a  point  on  the  p  ■  1  axis,  with  increasing 
n  [2 ,  Sect  ion  3.1.1]. 

For  1  i  a  5  2,  the  curves  extend  further  into  the  p  >  1  region,  attain 
a  stationary  point  for  p,  and  then  tend  asymptotically  to  the  p  ■  1  axis. 
For  y  >  2,  the  curves  initially  have  a  negative  gradient,  and  then 
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Figure  2.2 

Maximizing  Values,  (p,nh),  for  Variance  Bounds  in  the  Non-Regular  Case. 
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Figure 
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behave  as  in  the  case  1  5  or  £  2.  However,  this  family  of  curves  does 
not  conform  to  that  for  o  £  2.  There  is  thus  an  apparent  discont faulty 
at  o  «  2. 

The  dashed  curves  for  constant  sample  size  n  are  quite  consistent 
for  or  values  less  than  or  equal  to  2-  For  or  >  2,  the  points  for  n  ■  1 

fit  in  well.  The  points  for  sample  sizes  11,  21 . however,  do  not 

conform  to  the  main  family. 

Some  additional  numerical  work  along  the  above  lines  may  be  con¬ 
sidered.  In  any  such  additional  runs,  the  curves  of  Figure  2.2  can  be 
used  to  provide  quite  accurate  Initial  values  of  p  and  nh. 

2.4  Generalization  of  the  Bound 

Application  of  the  bound  of  inequality  (2.10)  to  the  Type  Ill 
distribution  is  analytically  quite  straightforward,  although  some 
additional  numerical  difficulties  can  be  anticipated.  We  consider  only 
the  case  r  ■  2. 

It  is  known  that  the  bound  of  order  2  will  be  an  improvement  over 
that  of  order  1,  i.e.  ,  over  the  bound  discussed  in  the  previous  section. 

We  shall  see  that,  in  addition,  most  terms  of  the  second  order  bound 
can  also  be  expressed  in  terms  of  the  integrals  g  given  in  equation  (2.15). 
Thus  the  bound  can  conceivably  be  Investigated  numerically  with  relatively 
considerably  less  programming  effort  than  was  required  in  the  original 
such  Investigation.  Whether  or  not  a  further  Investigation  of  this 
type  would  be  worthwhile  has  not  yet  been  detere’ ned.  This  could  be  an 
interesting  area  for  further  research.  Should  a  numerical  study  of  the  case 
r  ■  2  be  conducted  and  found  to  yield  results  suggesting  a  considerable 
improvement  in  the  bound,  values  of  r  in  excess  of  2  would  also  be  con¬ 


sidered. 


We  proceed  with  the  construction  of  the  bound  for  the  case  r  ■  2. 
The  second  order  bound  of  the  type  given  in  inequality  (2.10)  is 


(2.17)  V(t)  --  sup 

*- 1  *  *'2  ’ 11  *  ” 


•  2  2n 
h  /k 


PL-Cj  fp(x,a)+(c.-c  )fP(x,a+h)+c  fp(x,a+2h)]2 

J”0  — - - - ndXl 


where  f  it  the  Joint  density  of  each  following  a  Type  III  distri¬ 
bution,  k  is  given  in  equation  (2.13),  and  x  -  (x . .  ). 

1  n 

To  maximize  with  respect  to  the  c^'s,  we  u8e  the  fact  that  C2“l-c^ 
and  differentiate  with  respect  to  c^.  Write 


(2. 18) 


fy  "  [f(x,ftfv)]U. 


The  maximizing  value  of  c^  is  determined  from  the  equation 
(2.!0)  0  +  (2c1-l)£j  + 


+  <2cr1>£h  +  ( 1_  C  l'  *2^  (  ■  E0+2  f  h"  £2h)ndxr 


We  find 


(2-20) 


;  J---.r<£2h-£h><fS-;£^2h>tclti<ixt . 

/•••/  <£S-2£S+£2h>2f0lndxl 


The  integral  in  the  denominator  of  the  bound  (2.17)  therefore  becomes 

(2.21)  J...J[c2£2»+2c.;i-2ci)£JfP-2c1(l-ci)Ej£Ph+(l-2c1)2£2P 

-2(1-2C1)(1-C1)f^li+(1-C1>2£2p£-Wl 

’  A  J'---.f(£0P-4£S£h+2£S£2h44£hP-4£S£2h+£2E>£01,,d,Il 
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+  2CJ  •  •  ■J'(fOPfh-fOPf2h-2£h['+3fhPf2h-f2hP)tOJTId)‘l 

+  j'"'J'(f^-2fhPf2Vf2h)£Oln‘,*1 


/— J‘<fS'f2h)2£01,ldV 


{  J,"J’^f2h'fh^(f0'2fh'ft2h 


/...J(fJ-2fP+fPh)2f;1r.d,l 


(In  all  of  the  above,  h  Is  positive  and  the  limits  of  Integration  are 
a  <  <  00  for  i  =  1, . . .  ,n.  ) 

Equation  (2.21)  involves  six  basic  integrals, including  all  second 
degree  combinations  of  the  form  f  Pf  P t ,  where  v,v’  »  0,h,2h.  Except  for 
constants,  these  are 


v  v 


(2.22) 


CO 

J  ...I*  n(x  -a)(2p'1)(a'1)exp[-2:(2p-l)(x  -a)/0}  Ildx.  -  pV*(2(0f-2p-«+2) , 
a  w  a  i  i  l 


(2.23' 


CD  03  _  2p(Qf-l) 

II(x1-a-h) 


r  ...r 

a+h  a+h 


exp{  ^pECx^a-h)/^} 


ll(xj-a5  1exp[  -E(x1-a)/P} 


Ildx, 


n  nh/Pp"  «“  2p(ar-l).  .  2p-l,  x -(«-!)  -y , 

Pe  Jo"\f0y  (y  +  -  p  h)  e  *dy 

pnenh/^gn(h(2p.i)/p,2p,cr-l). 


“  CD  ...  .2p(ar-l) 


^2.24)  p  p  ri(x1-a-2h)‘'l'v'"  *;exp( -2pE(xi-a-2h)/@} 

a+2h  a+2h  ~  7~7  T77  ndxi 

Il(xi-a)  expl-E(xi-a)/pj 


pne2nh/0gn(2h(2p-l)/0 ,2p,o-l) , 


03  00 


(2-25)  J  .  f 

a+h  a+h 


n[(xi-a)(x1-a-h)]P(Qf"1)exp{-  ^  E(2xi-2a-h)} 
n(x1-a)Qf‘1  exp{-E(x^-a)/fj) 


Ildx, 
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UU  Oj  |  I  1 

"•To'*’^o  n^xi+h)P  ljti^  exp^’  F  ^(p-oo^+io+px^}  ndxi 

»  Pt‘ea-P)nh/eg\^  h,  T^,  (1-P)(a-I)j, 


(2.26)  (*“  j‘"  n[(xi-a-2h)(xi-a)]p^  exp{-  r-2  I(r..-a-h)) 

4+2:1*  *  "4+2 h  ^77“  5 - 5Idxi 

H(xi-a)  expt-E^-a)/^} 

-  pne2(l-p)nh/Pgn^2(2.P-1jii,  _2_,(l-p)(a-l)) 


and 


(2 - 2 7 )  J  ...J 

a+2h  a+2h 


n[(xi-a-h)(x1-a-2h)]p(Qf'1)exp{-  | E(2Xi-2a-3h)} 


n(x^-a)°  *  expC-E^-a)/^} 


-fldx. 


pne(2-P)nh/6  j*. .  Il^U^h)^  Ifcl  h)"  ^ 


pne(2-p)nh/egn^h  2^1^  p 2^1  ^ 


say,  waere 
(2.28) 


,  P  ^2py-2p-q+2 

p “v ipO 


Note  that  the  last  integral  is  defined  in  terms  of  a  new  special 
function,  namely, 

CD 

(2.29)  g  (b,a,c,d)  ■  J  [y(y+d)]aC(y+b)  Ce  ^dy. 

1  0 

All  other  integrals  involve  only  the  function  g.  For  increasing  r, 
similar  additional  special  functions  are  introduced.  The  exact  form 
of  these  has  not  been  investigated. 


ndxj^ 
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3.  APPROXIMATIONS  TO  THE  PITMAN  ESTIMATOR  FOR  THE 
TYPE  III  DISTRIBUTION 

Previous  results  had  led  to  the  conjecture  that  Che  mininwm  ob¬ 
servation,  Y^,  is  an  efficient  estimator  of  a,  or,  at  least,  is  "nearly 
efficient"  in  the  sense  that  the  order  of  magnitude  in  n  of  its  asymptotic 

variance  agrees  with  that  of  the  optimal  bound.  As  noted  previously, 

-2  fa 

he  variance  of  Y^  is  of  crder  n  .  The  numerical  results  given  in 

_2  fa 

the  previous  chapter  suggest  that  the  bound  investigated  is  ) 

only  for  1  -a  -  2.  The  unresolved  question  with  regard  to  the  remainder 
of  the  range  of  a  is  whether  the  lack  of  agreement  »n  order  of  magnitude 
is  due  to  inefficiency  of  the  estimator  or  sub- optimality  of  the  bound. 

(A  third  possibility,  of  course,  is  that  the  numerical  results  are 
anomalous.  Th^s  could  result,  for  example,  because  of  convergence  to  a 
local  maximum  which  is  orders  of  magnitude  smaller  than  the  global  maxi¬ 
mum.  There  is,  however,  no  evidence  to  support  such  a  conclusion.) 
Assuming  that  the  numerical  results  are  correct,  one  suspects,  since  the 
bound  ultimately  decreases  very  rapidly  with  increasing  n  for  or  <  1, 
that  the  difficulty  is  inherent  in  the  bound,  but  the  question  remains 
open.  In  any  case,  the  knowledge  that  Yj^  is  not  even  sufficient  other 
than  for  a  =  1,  along  with  the  possibility  that  it  is  inefficient  even 

with  respect  co  order  of  magnitude  in  n,  provides  motivation  for  an  in¬ 
vestigation  of  alternative  estimators.  Alternatives  of  the  type  suggested 
by  the  work  of  Pitman  [ll]  are  the  subject  of  this  chapter. 

3 . I  Pitman's  Estimation  Technique 

The  estimator  introduced  by  E.  J.  G.  Pitman  [11]  is  "optimal" 
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according  to  several  criteria  of  optimality  and  under  quite  general  con¬ 
ditions.  Pitman  proved  that  it  is  unbiased,  minimum  variance  among  all 
invariant  estimators  and  it  has  been  shown  by  Stein  C 12 3  to  be  admissible 
under  mean-square-error  loss.  This  estimator  is  therefore  "best"  under 
almost  any  reasonable  definition  of  the  terra. 

The  basic  Pitman  method,  in  general,  is  as  follows.  Let  X, » • • • ,  X 

i  n 

be  independent  and  identically  distributed  random  variables  with  dis¬ 
tribution  function  of  the  form  F(x-9)  admitting  of  a  density  F'(x)  =  f(x). 
Suppose  f(x)  ■  0  for  x  <  9.  Let  Y^  §  Y^  =...=  Y^  be  the  corresponding 
order  statistics. 

The  Pitman  estimator  is 


(3.1) 


<p(X. , . . .  ,X  ) 
i  n 


L  W(¥9> d9 

00 

J*  S  f (X.-9)  d9 

-«  i=i  i 


Note  that,  because  of  the  assumption  X,  >  9,  the  limits  on  the  integrals 
in  equation  (3.1)  actually  extend  only  to  the  minimum  observation,  Y^. 
Furthermore,  this  expression  can  be  written  in  terms  of  the  Y^  as  well, 
namely 

L 6  iif<ve> de 


(3.2) 


9  (Yj, 


J  1  n  f(Y  -9)  d9 


Substitution  of  the  Pearson  Type  III  distribution  into  equation  (3.1)  or 
(3.2)  yields  integrals  which  cannot  be  expressed  in  closed  form.  An 
approximation  to  the  estimator  must  therefore  be  constructed  in  order  to 
pursue  the  analytical  investigation  in  this  case.  The  remainder  of  this 
chapter  will  be  devoted  to  a  series  of  approximations  based  upon  an 

alternate  representation  of  the  Pitman  estimator  for  densities  bounded 
from  below. 
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3 . 2  Approximations;  Application  to  the  Type  III  Distribution 

An  alternative  representation  of  the  Pitman  estimator  which  immediately 
suggests  a  relatively  simple  approximation  is  obtained  as  follows.  Let 
0  -  Y^  -  X  and  for  i  *  2,...,  n.  In  terms  of  these  vari¬ 

ables,  the  estimator  becomes 


(3.3) 


WZ2” 


’V 


Jn(Y1-X)f(A)  ^f^+X)  dX 

CO 

f  f(X)  ft  f(Z  +X)  d\ 
i-2  L 


Xf(X)  ft  f (Z ,+X )  dX 

i=2  i 


J  f (x)  n  f (z  +X)  dx 

0  i-2 


=  y.  -  e{y,  Iz  . z  ,e  =  o}. 

1  1 '  2  n 

Thus  the  estimator  can  be  expressed  as  the  difference  between  Y^  and  he 
regression  of  Y^  on  Z^, . . . ,Z^.  The  essence  of  the  approximation  to  be 
developed  below  is  to  restrict  consideration  to  a  fixed  number  m  <  n 
of  the  Z^  and  to  use,  instead  of  the  above,  the  estimator 


(3.4)  9  =  Y  -  E  {Y.|Z  ,...,Z  }, 

l  12  m 

where  E  {y,  |z^,.  ..,Z  }  is  the  best  linear  regression  of  Y,  on  Zn,...,Z  . 

1.  Z  m  -  12  m 

We  propose  to  investigate  the  asymptotic  properties  of  estimators  of 

this  form  for  the  parameter  a  in  the  Pearson  Type  III  distribution. 

Without  loss  of  generality  we  may  take 

o-l 


(3.5) 


f(x) 


X  -x 

r »  e 


x  *  0 


-  0 


x  <  0. 
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The  determination  of  E  requires  a  knowledge  of  the  first  two  moments  and 

the  second  order  cross  moments  of  Y. . Y  •  Since  the  exact  forms  of 

1  m 

these  are  quite  complex,  we  again  seek  approximations.  Approximations 
to  these  moments  for  large  n  (and  fixed  m)  are  determined  as  follows. 

We  may  write 

(3.6)  -  F'1(Ui), 

where  U,  5  U,  5,.,5  U  are  order  statistics  from  a  uniform  distribution 
1  l  n 

on  (0,1).  Since 

(3*7)  F(x)  «  r (a  j  J* o  c  dt» 

we  have,  for  x  sufficiently  small  so  that  e  X  =  1, 

(3  V  F_1(u)  =  [ar<cr)u]1/C* 

-  [uTfer+l)]1/Q'. 

It  is  easily  seen  that  the  density,  say  h^,  of  is 

(3.9)  h£(u)  -  i©U1“1(l-u)n'i  0  <  u  <  1 

■  0  otherwise. 


It  follows  that 

(3.10)  EYir  -  ECUir(a+l)]r/Q' 


■  [r  (o+i)] 


J)/1 
J  0 


r(i)  r (n- i+1)  r(nfl+x/df) 


To  facilitate  the  ensuing  calculations,  it  is  convenient  to  make 
one  final  approximation.  Using  Stirling's  Formula,  we  have,  for  >«ge  n, 
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(3.11)  l°g  r(^f§j  “  <n+b'1)  ■  +  (nfc-l)  log(nfc-l) 


-  (nfb-1)  log(nfb-l)  f  (l/2)log(nfc-l) 

-  (l/2)log(i»fb-l) 

-  b  -  c  +  (n-l)log(~^)  f  c  log(nfc- 1) 

-  b  log(nfb-l)  +(l/2)log(^i) 

-  b  -  c  f  (n- 1/2)  log  (l+  +  c  logC*^'1) 

-  b  log(nfb-l) 

-  b  -  c  +  (n-1/2)  +  c  log(nfc-l)  -  b  log(nfb-l) 


c  log(nfc-l)  -  b  log(nfb-l). 


Thus 

(3.12) 


F  (nfc)  •  (nfc-l)C 

r(ttfb)  "  . b 


(nfb-1) 

Applying  this  to  the  right-hand  side  of  equation  (3.10),  we  obtain 
r  •  [r(afl)]r/a  r (if r/a) 


(3.13) 


EY . 


n 


r(i) 


(nfr/cr) 


If  r/a 


T ( if  r/a) 

r  (i) 


.(TJ^ 

Similarly,  since,  for  i  <  j,  and  have  joint  density 
(3.14)  b  (u,v)« 


ij' 


(i-l).'(j-i-l).'(n-j).‘ 


i-1  ,  J-i-1.,  Nn-j 

U  (v-u)J  (1-v)  , 


we  find  that,  for  i,J  small  with  respect  to  n, 


(3.15)  EY.Y 


n.'[r  (orfl)] 


2  /a 


1 


1  j  (i-l)'(j-i-l)'  (n- J )  ■'  0  0 


r  r '  „i-ifi/of  i/a ,  vj-i-i 

7  J  J  u  v  (v-u)J 

1  r\ 


(l-v)n  ^  dudv 
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Making  the  substitution  u  ■  tv  in  this  expression,  we  obtain 

/  -i  j  n  p-v  v  m  “•>-*  v*'1/-!  f  T  j-l+2/or  .  .n-J  i-l+l/or  .J-i-1. 

(J.16)  EY.Y,  *  , .  - rrr  J  J  VJ  (1-v)  ’t  (l-t)J  dtdv 


nl[r  (Qf+1)]‘ 

'l‘J  (i-i).’(j-i-i):(n-j)i  >J0  JQ 


n.'[r(o4-i)]2/cy  r(j-f2/or>  r(n-j+i)  r(i+i/o)r(j-i) 
(l-l).'(J-i-l)!(n-J)!  '  T  (nf  1+2/tf)  r(j+l/o) 


.l£i2±Ui_L  rm+n  rq+2/o)  rq+i/ar) 
r  ( i )  r(nfi+5/a)  rq+i/o) 

if  Ue+i)Y/a  rq+i/o)  r(j+2/a)  . 

V  n  /  T(l)  r<)+lA») 


It  follows  from  equation  (3.13)  that 

(3. 


,17)  nV  -  (Vf*)2'°  , 

and  from  equations  (3-13)  and  (3.15)  that,  for  i  <  ), 

\2/a 


(3.18)  Cov  (Y 


rV  - 


rq-Hi/a) , 

r  ( i) 


fm±2 m  .  m-fi/Qr) 
Lfo+i/“)  r(j)  _  * 


To  determine  the  best  linear  regression  of  Y.  on  Z» . Z  ,  we 

jL  2  m 


minimize  the  quantity  e{y  -c,  -  E-  c.  Z,}  ,  c,  being  the  constant  and 

M  '  1  In  i»2  im  i  lm 

c„  . . . . ,c  the  coefficients  of  Z„,...,Z  ,  respectively,  in  the  mth 
2m  mm  2  m 

order  approximation.  Thus  we  determine 

O.H)  inf  ElV  Clm  -j2  c^]2  -  inf  eIy,  -  «,.  -  -  V,)} 

/»  r* 


C  .  }  •  •  •  »  c 
lm  nm 


Clm”*  '  ,Cmm 


m 


■  lnE 

Cl» . C- 


ro  y 

-  c  -  Ec  Y  }  . 
lm  2  i 


mm 


Equating  partial  derivatives  to  zero,  we  obtain 
(3.20) 


m 


m 


0  -  c.  ■  (ll  E  c  )EY,  +  E  c  EY.  , 
im  i*2  im  1  ^  im  i 
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for  J  ■  2...,  m.  With  the  notation 

(3.24)  Vi  "  Vii  "  V(V’ 

(3.25)  V  '  Cov(Yt.Yj. 
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equation  (3.23)  becomes 


(3-26)  v  1  +  E  c  ;  I  V  -V  j  +  £  c  (v  -V  )  -  0 

v  ^  L2  ^  l  1  ijJ  i-2  lmv  l)  vli 


(3-27>  &  ci«(vrvij +  V  V  ■  vu  ■  vi 


A  matrix  representation  of  the  general  solution  of  this  system  of 
equations  is  quite  straightforward.  Mote  that 


(3.28)  Vi  -  +  VtJ  -  Vn  -  E(Yi  ’  Vj  +  Vj  *  VP 


-  [  (EYj  )2  -  EY^  +  EY^  -  EY^EY J 


-  E(Y1-Yi)(Y1-Yj)  -  E(Y1-Yi)  EfY^Yj) 


Cov(Y1“Y^  .Y^Yj  ) 


Uij  , 


say.  Thus  the  system  of  equations  (3.27)  can  be  written 


(3.29) 


£  c  u  -  v  -  v 

1-2  in:  ij  lj  1 


>ay,  where  X^  -  -  V^,  or,  in  matrix  notation,  as 


(3.30)  Ac  -  X, 

mm 


where  A  is  the  (m-l)x(a-l)  matrix  with  elements  U..,  c  is  the  vector 
m  i  J  m 

of  c  's,  and  X  is  the  vector  of  X  's-  Thus  the  matrix  representation 
in  j 
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of  the  general  solution  is 


(3.31) 


c  =  A  ^  A . 
m  m 


An  explicit  solution  of  this  equation,  expressing  the  c,  as  functions 

lm 

of  o,  has  not  been  obtained  for  general  ra.  It  is  interesting  to  note 

2/a 

that  for  the  Type  III  distribution,  since  the  quantity  (r(a+l)/n) 


factors  out  of  each  and  this  is  the  only  function  of  n  involved  in 
V^,  the  vector  c^  is  independent  of  n.  We  next  consider  the  explicit 
results  for  m  =  2,  3  and  4. 


3.3  Approximations  of  Small  Order 

For  small  m,  it  is  possible  to  express  the  c^m  explicitly  as 
functions  of  n  and  O'.  We  begin  with  the  case  m  ■>  2.  For  m  *»  2,  equation 
(3.23)  yields 


(3-32) 


-EY12  +  (EY^)2  +  EYjY2  -  EY1EY2 
22  EYj  +  EY22  -  (EY1)2  +  2EY1EY2  -  2EY1Y2  -(EY^2 


-V1  +  V12 
V1  +  V2  -  2V12 


where  the  notation  is  as  in  equations  (3.24)  and  (3.25).  Thus,  from 
equations  (3.17)  and  (3.18), 

(3.33)  c22  =  {-  r(  1+2/a)  +  r2(  1+1/a)  +  r(l+l/a)  -  r(2+l/<y)]  }  -r 

( 1+2 /a )  -  r2(l+l/<y)  +  T(2+2/a)  -  r2(2+l/a) 

-  ?r(i+i/cv)  -  r(2+i/a)]} 

■  ( l+2/a)  +  r2(l+l/a)  r (2+2 /a)  -T  (1+1/a)  T(2+l/a)]  ~ 
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I2  r<1+2/Qf)  -  C2+l  +  h)  r2(i+/tt)  -  21  ^  r < 2+2 /or > 

a 

+  2T(  1+1/a)  r  (2+1/a)} 

^  r(l+2/3-)  -  r2(l+l/a) 

-^-rd+2/a)  -  ^  r2(i+i/of) 
and,  from  these  and  equation  (3.22), 

(3.34)  c12  -  ^rci+i/o)  +  c22  /C,r<i+i/») 

-  (?^)1AVH 

"(Pn*^)  /  r(  1+1/a)  [1+c22v1"(1+1/q'Vj 

-i^±^l/0fr<1+1/a)(1  - 

For  m  =  3,  the  solution  of  the  system  of  equations  (3.27)  is 

c23  -■53r<V12-V<VY2V13>  •  <V13-V1>(V1+V23-V12-V13>] 

(3.35,  -  ^3(VU-V1)  -  V23(VU-V1)  +  V13(V13-Vl2)] 

c33  '  ■53[V2<V13-V1>  -  V23<V12-V1)  +  V12 <V12' V13 >]  • 
where 

(3.36)  D3  -  (V1+V2-2V12)(V1+V3-2V13)  -  <VV12+V23_V13)2* 

To  express  these  results  explicitly  as  functions  of  a ,  we  use  the 
notation  ■  f(l+i/a),  and  kn»  f.r(ar+l)/n]  .  We  find,  from  previous 
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results,  that 

V1  ■  kn2[r2  -  Fl2] 


v,  -  k  r,  -  r,2] 

2  n  L  or  2  a  *■  1  J 

<3.37,  V3  -  ^majaa  r2  r32] 

v  .  k  r  -  r.2l 

V12  n  L(of+l)  2  a  lJ 

u  _  i»  2f  2(of+2)  r  .  <2or+l)(tt,t.U  r  21 
V13  kn  L  20f+l  1  2  ^2  1  J 

.2|  2(<H-l)(a+2)~  .  (o>+l?2(ay±l)  r  2"] 

V23  "•  "n  L  « (toil)  *  2  *3  1  J  • 

Note  that,  from  equations  (3.37), 

v  v 
v23  a  li 

(3.38)  V2  - V12 

„  „<££±Ui2Z±il  V  .  -  V... 


'3  2 

J  7a 


13  “  7a  23 


Thus 


D3c23  "  V3(V17.-V1)  '  V32(V13-V1)  +  V13<V13'V12> 

.  (s±me+ll  v  13(v12-v,)  -  +  v13(v13-»12) 

2a 

.  Vj3  [i2±Ul^±li  (V12-v1,  -  seal  (V13-V1)  +  (V13-»12)] 

2ct 
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(3.39)  -  k  2v  !  ,ll!z±±l  ,  £fr+l)(or+2)  a+1  r  r-(*(+l){a+n 

n  13l  <o(2>  +  l)  2  '  2  J  2  *  ' - \ - 

2a  (a+1)  2a  2  ^3 


.  i3«±4i«±n  .  ss±n  ;r  n 

■  2a2  J  1  J 


k  V  r_^-iL_r  1 

n  13L  2a2(a+l)(2a+l)  "J 


k  V-  r  ,2; 

n  2l  a~(a+i)(2>  fl)2  2  '-H  1  J 


4a 


Similarly, 


(3.40)  d3c  *  k  V  j — — ^r2-' — r,  -  r  2  . 

3  33  n  2~a(a+lr(Z>+l)  "  2a  1 


To  express  D.  in  terms  of  a  and  the  f . ,  we  determine 
3  l 


a+1 


(3.  »,+V2-2»l2"V1+vVu-2V1J-V1-^ 


V.  -SZlil 


n 


n 


La  (a+1)  F2  "  J?  1  1  j 


(a+1) 

1  p-  2 


Similarly, 


(3.42)  Vl  +  V3 


»  k  2 1  _j*±L — f 
n  a  *  ( 2a + 1 )  2 


(3a+l)' 


and 


(3.43)  V, 


V12  + 


23 


5a+4 


a  (a f  1)  (2a+l)  2  ^ 


3a+l  -  21 
3  l  J' 


Thus , 
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(3.44)  D 


Hence 

(3.45) 

and 

(3.46) 

Finally, 

(3.47) 


3  ■  (VY2V12><VV2V13)  -  'WW 


k‘{  r  ziSiai. 

®  ^  i  1  \  /  <Vv  . 


l^L 


gJ(a+l)(2a+l)  g2(a+l)2(2»+l)2j2 


'  J  i 


ri2& 

L  *vv 


^+33y2+13g+l)  _  2(15g2-Hfe44)  \  r  2 

»5(o+1)(2>+1)  2a4(a+l)(to+ir  2  1 


(3or+l) 
T 


4a 


v  M~fr+2)(3*  +4a+3)  r  2 

Hw<w  2 


(0+2)  _  i  2 

c23  =  - ««,-!)  -^npu!l£^L_ 
(o+2)(3tt2+4>+2)  g+1  r  2 

2  2  1  9 - 9  1 

(g+1)  (2»+l>  2  3*2 


g+2 _ P  _  1  p  2 

2  1 9  2  1 

(g+1)  (2a+l)  2a 

(a+2)(3ar2+4g+2)  r  g+1  r  2 

2  2  2  2  *  4  1 

g  (g+ir(2*+l)  2a 


C13  =  (1'K23+C33)  ^1  '  ^C23EY2  "  C33EY3) 


k  r,  +  [k  r,  -  k  (i+-)r.l  c  +  [k  r  -k  (2+i)(i-+i>r 

nl  Lnl  n  »  1J  2J  Inin  a  a 


k  r  (1  -  -  c  -  ■^r— -  c  ) 
n  1U  O  23  _  2  33; 

2 a 


(a-l)(a- 


n  1  L  a2(a+l)2(2a+l)2  a2(a+l)(2a+l)2 


(a+2)(3a+l)  A 
2a2(a+l)2(2a+l)  2 


.  f". £±k  +  a-l  .  3a+l  ,  r  2 

v-  ^  1 


~(a+2)(3tt  +4a+2)  r  .  a+1  r  21 
La2(a+l)2(2a+l)2  2  2a4  1  ^ 


- 9[±2 — _  k  p  p 

2(a+l)  (2a+l)  K1T2 


(a+2)  (3a  +4a+2)  ^  a+1  r  2 

2  2  1  2  2  1 

(a+ir(2a+l)  2a 


Similar  tedious  algebra  leads,  for  the  case  m  =  4,  to 


(3.48)  c,  .  -  afcll  r  -  Slj  r/]. 

24  A  L(2a+l)2(3a+l)2  12a3 


(3.49) 


-1)  f  (a+2)(3a+2)  r  _  _J_  2  1  =  _a 

i  L  (a+ 1 )  ( 2a+l ) 2  ( 3a+ 1 ) 2  2  1>Vv3  1  J  <*+ 


a+1  24 


l!  f  a+2  „  1  _  2“ 

(3.50)  c  « ~  - 2 - r  -  -~r,  , 

44  A  L (a+1)  (2a+l)  (3a+l)  2  12y3  1  J 


(3.51)  c 


k  r  r  (a+2) 
n  I  l 

a  '  ■■  '  ■ 

14  2(2a+l)  (3a+l)A 


where 

,,  .  „  2(»+2)0>2+3°r+2)  r  .  jrWji  r  2 

(J.52)  A  2*2  3 

(a+l)(2a+l)(3a+l)  *  12a 
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The  explicit  form  of  the  general  solution  is  not  apparent  from 

these  results.  It  is  interesting  to  note,  however,  that  for  the  case 

a  *»  1,  the  above  reduce  tofc  =  —  »  c,  and  c.  “0  for  i  £  2  and  all  m. 

ti  n  in  lm 

Thus,  in  spite  of  all  the  approximations,  the  estimator  reduces  to 
*  1 

a  =  Y,  -  — ,  which  is  known  to  be  the  best  unbiased  estimator  for  the 
1  n 

case  Of  «  1. 

3.4.  Comparison  of  the  Estimators 

The  improvement  attained  (asymptotically)  by  introducing  order 
statistics  other  than  into  the  estimation  procedure  can  be  assessed 
by  comparing  asymptotic  variances.  The  asymptotic  variance  of  the 

•jf 

estimator  for  general  m,  say  V^,  follows  readily  from  the  above  results. 

We  have,  with  Z  the  vector  of  Z's, 

— m  l 

(3.53)  Vj  -  V  (j,  -clm  -  j^CVV) 

-  v«i  - V 

-  V(V  ■  2  C°v  «l-°;  4»>  +  V(cm  2m> 

-  '(  *  K  Cov  <W  +  ci  V^»)cn, 
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(3.54) 


V  -  V*  -  X'  A'1  X. 
1  m  m 


The  right-hand  side  of  equation  (3.54)  is  always  positive  (or  t  1). 

Furthermore,  since  on  the  right-hand  aide  of  equation  (3.53)  the  quantity 
2  2./ct 

kn  ■(T(or+l)/n)  factors  out  of  both  terms,  it  is  clear  that,  so  long 

as  m  is  not  a  function  of  n,  the  order  of  magnitude  of  the  variance  of 

*  2 

the  estimator  involving  terms  up  to  order  m  remains  n  .  The  question 

-2  la 

as  to  whether  the  variance  remains  mn  *'")  when  m  increases  with  n 

(for  example,  m  ■  n  )  remains  open. 

A  small  numerical  study  of  the  improvement  in  the  variance  (using 

the  above  asymptotic  results)  by  the  introduction  of  higher  order  terms 

has  been  conducted.  Note  that,  given  the  c^,  the  numerical  calculation 

of  V*  is  most  conveniently  performed  by  use  of  the  relation 
m 

(3.55)  V*  -  V,  -  X'A-1  X 

v  m  1  m 


V,  -  X'c 
1  m 


m 


vi  wv- 


The  results,  for  m  «  2,  3  and  4  and  or  «=  .25,  .50,  .75,  1.50,  2.00  and 

3.00,  are  given  in  Tables  3.1  and  3.2.  Table  3.1  gives  the  c^m  and 

2 /a 

Table  3.Z  n  times  the  asymptotic  variance  of  the  approximations  to 

the  Pitman  estimators  and  the  asymptotic  efficiencies  relative  to  Y^- 

(Note  that  n1^*  c,  are  tabulated.  As  noted  previously,  the  remaining 
lm 

c,  are  independent  of  n.) 
lm 

Some  of  the  results  of  Tables  3.1  and  3.2  indicate  a  number  of 
potentially  fruitful  topics  for  further  investigation,  both  analytical 
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43 


(-x)  Indicates  that  tabulated  value  is  to  be  nultiplied  by  10 


TABLE  3.2 


Asymptotic  Variances  and  Relative  Efficiencies 
of  the  Approximations  to  the  Pitnen  Estimator  for  the 
Pearson  Type  III  Distribution 


Of 

.25 

.50 

.75 

1.50 

2.00 

3.00 

2/o  * 
n  VL 

18,107 

12.337 

2.0717 

.54913 

.42920 

.34780 

n2/“v* 

16,464 

11.515 

2.0357 

.52505 

.37775 

.26189 

»2A,v; 

16,18* 

11.292 

2.0224 

.51278 

.35168 

. 22259 

2/a  * 
n  V4 

16, 11? 

11.205 

2.0159 

.50501 

.3350? 

. 19882 

* ,  * 
VV2 

1.100 

1.071 

1.018 

1.046 

1.136 

1.328 

*  .  it 

VV3 

1.119 

1.093 

1.024 

1.071 

1.220 

1.563 

*  * 

V\ 

1.124 

1.101 

1.028 

1.087 

1.281 

1.749 
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and  numerical.  It  is  interesting  to  note,  for  example,  that  the 
apparently  converge  quite  rapidly  for  fixed  i  as  n>  increases  (e.g., 
for  a  »  .5,  c22  -  .1667,  c23  -  .07627  and  c^  -  .07569),  and  that,  for 
another  example,  in  the  non- regular  case  the  minimum  observation  appar¬ 
ently  contains  the  most  significant  information  relative  to  the  location 
parameter  (the  asymptotic  variance  decreases  relatively  slowly  as 
additional  observations  are  introduced).  We  plan  to  investigate  these 
aspects  of  the  problem  more  thoroughly  in  future  research.  It  is  not 
surprising,  incidentally,  that,  in  the  regular  case  Investigated  (a  *3), 
the  efficiency  increases  more  rapidly  as  additional  observations  are 
introduced  since  the  maximum  likelihood  estimator,  which  is  asymptotically 
efficient  in  this  case,  is  a  function  of  all  of  the  observations. 

Other  aspects  of  the  problem  of  interest  for  further  investigation, 
particularly  in  the  non-regular  case,  include  the  small-sample  properties 
of  the  approximation  to  the  Pitman  estimator  and  a  comparison  of  the 
approximations  with  the  exact  Pitman  estimator.  Some  Monte  Carlo  studies 
of  these  aspects  of  the  problem  are  anticipated. 

A  very  interesting  and  difficult  additional  topic  for  further  in¬ 
vestigation  is  the  problem  of  estimating  a  when  the  remaining  parameters, 
a  and  (3,  are  unknown.  Because  of  the  relative  rates  of  convergence,  it 
is  by  no  means  clear  that  a  Pitman-type  estimator  such  as  the  above  can 
be  constructed  in  the  non-regular  case  when  all  parameters  are  unknown. 

We  plan  to  pursue  this  aspect,  as  well,  in  future  investigations.  The 
apparently  pathological  case  of  or  exactly  equal  to  2  is  an  additional 

challenge  of,  at  least,  academic  interest. 

Finally,  we  plan  to  investigate  approximations  other  than  the  linear  one 
discussed  above.  The  objective  of  such  an  investigation  would  be  the  deri¬ 
vation  of  an  estimator  which  converges  to  (i.e.,  is  asymptotically  equiva¬ 
lent  to)  the  exact  Pitman  estimator. 
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4.  APPLICATION  OF  THE  APPROXIMATIONS  OF  THE 
PITMAN  ESTIMATOR  TO  THE  WEIBULL  DISTRIBUTION 


For  Che  Weibuil  distribution, 


(4.1) 


f(x)  -  K(x-a)*"1  e'<X'a) 


x  >  a 


otherwise , 


the  analysis  of  the  Pitman-type  estimator  is  very  similar  to  that  given 
above.  Equation  (3  31),  in  fact,  provides  a  general  solution  for  the 
c  *  It  restains  to  express  these  explicitly  as  functions  of  the  Weibuil 

LIB 

shape  parameter,  K.  Because  of  the  nature  of  the  two  distributions,  these 
can  be  expected  to  be  of  the  same  general  form  as  in  the  Type  III  case. 

For  the  Weibuil  distribution,  however,  since  the  distribution  function 
can  be  expressed  in  closed  form,  the  distribution  of,  and,  in  fact,  the 
moments  of,  the  order  statistics  can  be  determined  explicitly  for  small 
sample  sixes.  Thus  approximations  of  the  type  given  in  equations  (3.8) 
to  (3.18)  are  not  necessary  in  the  Weibuil  case. 

The  moments  of  the  Weibuil  distribution  have  been  derived  by 


Lleblein  L 10] .  The  rth  moment  of  the  ith  order  statistic  is 


l-l 

(4.2)  El/  -  i(“)r(l+;r  )  L  (-1)V  (V/  (iH-v-i+1) 


-l-r/K 


The  required  cross-moments,  with  i  <  J,  are 

(4.3)  EY^  -  (i-l).,(j-i-l)!(n-j)!  v^-0  i^-O  (1)  0^)0^  ' 

t(J-i+vl-v2,n- )+v  +1), 


where 
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1+1  /k 


(A. 4) 


*(s.t)  - 


^st 


V 


rv2  *1/ V2+i  l+iy 


p  ■  s/(s+t),  and  B  (.,.)  is  the  incomplete  Beta- funct ion. 

P 

We  shall  consider  the  linear  approximations  of  the  form  given  in 

the  previous  chapter  with  m  ■  2  and  m  -  3.  The  coefficients  c.  are 

lm 

expressed  in  terms  of  the  variances  and  covariances,  given,  in  the 

notation  of  the  previous  chapter,  but  with  B  ■  B  (1  +  — ,  1  +  —  )  and  f 

P  P'  K  K'  J 

Td+J/K),  by 


(4.5) 


■2/K  r  r  2 


(4.6) 


V2  -  i  n(n-l) 


-2/K 


-2/1 


(n-l)n  Jf2  -^n(n-l) 


-1/K  .  -1/K  >  2 

-  (n-l)n  j  11  , 


(4.7) 


(4.8) 


(4. 9> 


and 

(4.10) 


V3  -  \  [n(n-l)(n-2)‘2/K  -  2n(n- l)'2/K(n-2)  n‘2/K(n- 1) (n-2)  J 

-  ±  [n(n-l)(n-2)"1/K  -  2n(n- if  1/K(n-2)  +  n‘ 1/K(n- 1) (n-2)j2T  * , 

V12  -  ( 1+2 /K)  n  (n-l)'1/K  r2Bx  -  «* l/K  [n(n- 1)’ 1/K-n‘ 1/K(n- 1)  |T  * , 

n 

V13  "  2  (l+2/F)  "  [(n-l)(2n-4)*1/KB2  -  2(n- 1)" 1/K(n-2)31  jlf 

n  n 

*  2  n_1/I  [n(n-l)(n-2)'1/K-2n(n-l)'1/K(n-2)  +  n' 1/K(n- 1) (n-2)^ l , 


~  (1+2/K)  n  (n-1) 


-  |  n(n-l) 


1/K 


7\l/K  I.  /  v-l/K 

2(n-2)  B  j  -  (*.n-4)  x  2 

n^l  n 

n'1/K(n-l)j  |  n(n-l)(n-2)‘1/K-2n(n-l)'1/K(n-2) 
+  n  ^K(n- 1)  (n-2)  T  ^ . 
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The  c  are  now  obtained  by  substituting  these  results  into  equation. 

Lm 

(3.22)  and  (3.31).  Note  that  the  solutions  for  the  c  are  considerably 

more  complex  when  the  exact  variances  and  covariances  given  in  equations 

(4.5)  to  (4.10)  are  employed.  Furthermore ,  the  c ^  will  be  functions  of 

n  for  all  i.  Nonetheless,  a  small  numerical  study,  including  sr-.cral 

values  of  K  and  n,  has  been  conducted.  The  results  are  given  in  Table 

4.1.  The  numerical  evidence  suggests  that  as  n-»®  the  c  for  the 

ID 

Weibull  distribution  converge,  for  i  >  1,  to  those  computed  for  the 

Foarson  Type  III  utilizing  asymptotic  moments.  This,  of  course,  is  to  be 

expected  since  the  asymptotic  distributions  of  order  statistics  from  the 

Weibull  and  Type  III  distributions  are  identical  except  for  a  constant 

factor  which  cancels  out  in  the  derivation  of  the  c.  for  i  >  1.  It  is 

la 

interesting  to  note  that  the  convergence  is  apparently  moderately  rapid 
for  the  values  of  K  investigated.  Some  further  numerical  work  on  the 
Weibull  ca^e  is  anticipated.  In  particular,  a  tabulation  of  the  coefi- 
cienis  c  would  be  of  interest.  This  is  especially  true  of  the  case  a  ■  n. 

1ST! 

although  this  case  would  involve  a  substantial  amount  of  calculation  ex¬ 
cept  for  quite  small  n.  Small  sample  properties  of  the  approximations 
and  a  comparison  with  the  exact  Pitman  estimator  would  also  be  of  interest. 
Finally,  as  in  the  case  of  the  Pearson  Type  III  distribution,  the  question 
of  using  the  Pitman-type  estimators,  when  the  shape  and  scale  parameters 
(the  iatter  naving  been  set  equal  to  unity  in  the  above)  are  unknown,  re¬ 
mains  op  n.  We  plan  to  investigate  this  problem,  as  well,  in  future 


^♦8 


research • 


f 

\ 


i 

I 


I 


-9 


Indicated  that  tabulated  value  is  to  be  multiplied  by  10 


(Cont inued) 


3016  -.3913  -.5305  -.5722  -.5962 


5.  ADDITIONAL  RESULTS  ON  MINIMUM  VARIANCE  BOUNDS 


This  chapter  is  concerned  witl.  a  few  additional  results  on  special 
problems  in  non-regular  estimation.  Included  are  the  problems  of  con¬ 
structing  lower  bounds  on  the  variance  of  estimators  in  the  case  of 
densities  whose  domain  is  finite  and  depends  on  the  unknown 
parameter  and  for  estimators  of  the  parameters  of  mixtures  of  uniform 
distributions.  Since  relatively  little  effort  has  been  expended  on 
these  problems,  the  results  are  incomplete.  In  particular,  many  details 
of  the  estimation  problem  have  not  been  considered.  One  of  the  more 
interesting  aspects  of  both  problems,  however,  is  the  one  to  be  con¬ 
sidered,  namely,  construction  of  the  bounds. 

5. 1  Construction  of  a  Bound  for  Densities  with  Finite  Domain 

None  of  the  bounds  discussed  in  reference  [  2 ]  is  applicable  to 
estimators  of  a  parameter,  say  9,  for  densities  with  finite  domain  de¬ 
pending  on  9.  An  example  of  such  a  density  is  the  uniform  distribution 
cn  (9,9fl) ,  viz. , 

(5.1)  r(x,«;  “l  9  5  x  2:  9+1 

=  0  otherwise. 

It  is  not  difficult,  however,  by  use  of  the  same  basic  ideas  involved  in  the 
derivation  of  the  Chapman-Robbins  and  Fraser- Gut tman  bounds,  to  construct 
bounds  for  densities  of  this  type.  Although  a  more  general  result  can  be 
derived,  we  shall  here  consider  only  the  relatively  simple  density  of 
equation  (5.1).  The  essential  additional  idea  needed  to  develop  a  bound 
for  this  example  is  that  of  approximating  the  density  from  inside.  This 
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will  avoid  the  problem  of  a  zero  denominator  in  integrals  such  as  those 


in  the  Chapma.i-Robbins  bound. 

We  proceed  as  follows.  Let 

(5.2)  fh(x,6)  -  WS  xS  W-h 

«=  0  otherwise, 

where  0  <  h  <  -j.  Suppose  that  for  the  family  of  densities 

{  f^(x,9)  ]  0<h<| , -°°<9<co}  ,  the  statistic  t  =  t(X^ . Xn) ,  where  X^ . X^ 

is  a  sample  of  size  n  from  f^(x>9),  is  an  unbiased  estimator  of  £fh, 
identically  in  h,9.  Then  by  an  argument  exactly  as  before  [2  ], 


(5'3)  V(°  '  rS1  ^ - 7z - ^1 -  ’ 

2  J  ...J  lnfh(xi,9)  -  Pf0(xrB)r  nf0  l(Xi,9>ndXi 


0 


9 


Thus 


CKh<±  (l-2h)  -1 


The  maximizing  value  of  h  is  the  solutira  of 

(5.5)  0  -  (l-2h)"n  -  1  -  nhU^h)'"'1 
i.e. ,  of 

#yi.l 

(5.6)  0  -  (l-2h)  +  nh  -  1  +  2h  . 

32 


For  large  n,  the  term  (l-2h)  tent's  to  zero  since  0  <  h  <  so  that 

an  approximate  solution  of  equation  (5.6)  is  h  «  — -r*  Substitution  of 

n+2 

this  approximate  solution  into  inequality  (5.4)  yields 


(5.7) 


V(t)  £ 


(1 


-  1 


2  , 


_  --157  . 

n2 

We  shall  see  that,  although  this  is  not  an  optimal  result,  it  is  of 
the  correct  order  of  magnitude  in  n.  (It  is  therefore  not  unreasonable 
to  assume  that  the  usual  type  of  generalization  to  bounds  based  on  higher 
order  differences  will  give  the  optimal  result.)  Although  it  is  easy  to 
construct  examples  in  which  this  is  not  the  case,  for  the  specific 
example  chosen  an  optimal  estimator  can  be  deduced  from  simpler  con¬ 
st  j*’- it  ions.  In  fact,  the  statistic  (X.-.jX.  ■  >,  where  X.,.  is  the  ith 

(I)  (n)  (i)  - 

order  statistic,  is  a  minimal  sufficient  statistic  for  G.  To  determine 
the  best  linear  combination  of  X^  and  X^ ,  where  "best"  is  equivalent 
to  minimum  variance,  unbiased,  we  need  the  joint  distribution  of  X^» 
x(n).  namely, 

(5.8)  dF  „  (x  ,x  )  -  n(n-l)[F(xn)  -  F(x.)]n'2dF(x.)dF(xn) 
X(i),X(n)  In  n  1  In 

"  n(n-l)(x  -x  )D  ^dx.dx 
n  i  in 

with  0  <  <  xn  <  Gfi.  rhus  the  marginal  distributions  are 
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(5.9) 


9-t-l 


f  (x^  -  nO-Xj+0) 


9  <  x, 


otherwise, 


(5.10)  £„  <V‘n(V9> 


9  <  x  <  9+1 
n 


otherwise. 


We  find 


(5.11) 


EX(L)  "  I  ft  nx(l-x-0)n  dx 


*  9  + 


(5.12)  ®^(n)  "  8  +  n+T 


By  symmetry  we 


conclude  that  V(X^)  “  ^X(n)^ 


Thus  the  best  linear 


enmbinatlon  of  X.n  and  X,„,  is  evidently 

v,i;  n-*/ 


[5. 13) 


_  X(l)  *  X(n)  '  1 

-  - 


compute  V(&),  we  need  V(X(1))  and  Cov(X(1),X(n)).  Since 


2  f>®+i  nfl 

(5.14)  E(l-X(1)  +  «)  -J9n(1'3rt-8>  dx 


n+2  ’ 
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(5.15) 


n 


(1)  "7^2  "  (1+#>  '  4  2(1+«)BX. 


n+2  '  (1+0>2  + 


e2+^e  + _ L_ 

**1  (n+l)  (ih-2) 


Similarly,  since 


(5.16) 


€►+1  *+l 


E(X(n)‘X(l))  "J  J  n(n-l)(x  -x  )"dx  dx 

9  x  “  A  x*  l 


we  find 


— sfa-1)., 

(tt+l)(n+2) 


(5.17) 


EX(l)X(n)  "  ®2  +  9  +  ^2  * 


It  follows  from  equations  (5.11)  and  (5.12)  that 


(5.18) 


V<X(1)) 


e2  +  ~  e  + - 1 _ 

0+1  (n+l) (n+2) 


- ^i>2 


(»+l)  (n+2) 


V(X(n)>’ 


and  from  equations  (5.11),  (5.12)  and  (5.17)  that 


(5-19) 


Cov(\ir’W  '~2 — 

(n+l)  (n+2) 


(5.20) 


V<3>*  '<*(.)>  +  +  2Cov<X(1)>X(d))] 

1 

K  — ■  ■ 

O  /  _  .  1  \  i  .  A  V  * 


Hence  the  optimal  bound  is  evidently,  for  large  n,  approximately  l/2n^. 


5.2  Estimation  for  a  Mixture  of  Two  Uniform  Distributions 

Suppose  XL . Xn  are  a  sample  of  size  n  from  a  mixture  of  two  uni¬ 

farm  distributions,  defined  on  (0,0^  and  (O.ep,  respectively.  Recall 
that  the  uniform  distribution  itself  presents  a  non-regular  estimation 


problem.  This  is  also  true  of  a  mixture  of  uniforms.  In  fact,  such 
mixtures  are  examples  of  distributions  for  which  the  regularity  con¬ 
ditions  fall  to  hold  for  several  parameters.  Although  we  are  admittedly 
a  long  way  from  solution  of  the  general  problem  of  non-regular  esti¬ 
mation,  if  is  interesting  to  investigate  the  additional  complexity 
introduced  because  of  the  mixture  structure.  The  mixture  to  be  con¬ 
sidered  is  one  of  the  simplest  such  distributions.  Furthermore,  as  we 
shall  see,  the  estimation  problem  has  been  partially  solved  for  this 
example. 


The  density  function  of  a  mixture  of  two  uniform  distributions  is 


(5.21) 


«1  *2 
f(x)  =  a"  +  9" 
1  2 


0  *  x  *  0, 


-  0 


otherwise. 


where  0  <  0^  <  0j  <  or  ,cr  >  ®  flnd  01 1  +  a2  “  We  shall  assume  that 
the  mixing  probabilities  are  known.  Although  we  shall  not  give  a 

detailed  analysis  of  the  estimation  problem  as  far  as  0^  is  concerned, 
it  is  easy  to  see  that  the  maximum  observation,  *s  a  consistent, 
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highly  accurate  estimator  of  0^ ,  having  vari«..ce  of  order  n 


-  L 


In  fact 


X.  .  is  the  maximum  likelihood  estimator  of  0.  We  proceed  with  the 
(n)  2 

problem  of  estimating  9^. 

Note  firstly  that  the  likelihood  function  can  be  written  as 


(5.22) 


a  a2  R  a  n'R 

L " ^  h  +  T2 :  'V  • 


where  R  =  number  of  X^  =  0^.  Thus 

a  Qr0  or 

(5.23)  logL  =  R  log  “  -r  “  y  +  (n-R)  log  ^  y 


a , 


V2 


anlo^y'  +  R  log  V1  +  J' 


The  maximum  likelihood  estimator  of  0^  is  therefore  that  value  0^  “  X^ 


such  that 


(5-24) 


,  108  d  +  db^y. 

Jal . .  2  ( J)  2  (N) 


where  either  0„  is  also  known  or  X.  Is  substituted  for  0  .  Note  that 
2  (n)  2 

r  ★ 

if  we  write  Yj  *  j  log  ll  +  (q  ^0-,/q  ?X  ^ ) J ,  then  9^  is  a  function  only  of 

Y^,  an  extremal  order  statistic.  This  suggests,  from  past  experience, 

*  - 1 

that  the  variance  of  9^  is  of  smaller  order  than  n  ,  Before  proceeding 
with  this  analysis,  we  consider  the  problem  of  constructing  a  lower 
bound  on  the  variance  of  estimators  of  9^. 

We  note  that  the  Chapman- Robbins  bound  is  applicable  and  that  in 
similar  problems  it  has  resulted  in  bounds  of  the  correct  order  of  mag¬ 
nitude.  There  ar»  several  ways  of  formulating  .he  Chapman- Robb l ns  bound 
in  this  problem.  The  simplest  procedure  is  evidently  to  derive  the 


bound  in  terms  '-f  £(x,9j)  and  f(x,9^-h),  with  0  ■w  h  t  0^.  We  find 
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0-25) 


V(t)  '  SUP  - 8 

r  2  r  2  „  r  r 2 


iIo2---J02 11  ^f2(xr0rh)f  l(yl,0l)dxl-'  ■  1 


The  Integral  in  the  denominator  o£  inequality  0.25)  is 

0„ 


(5.26)  £  (x,9  -h)f  (x,0  )dx 

0  1 


;rVh  °i  °2  2  «,  ;l.  .  r*i  !2  2,1i 

U0  '  »j-h  «2  ^  v  92  J  «  -h  ''V  '■*!  V 


9_  cr  n 

r.  2  2  ! 

+j8,  ^ 


or,  <»„  2 


“o  2  o„  or,  o- 


or,  cr. 


tj\  6^5  +  r2  J  <Vh>  +  KTf'  h  ♦  ^  +  r/VV  K  ^  +  9 


Thu* 


I r  ®,  a2  <*,  -  »,  %  -1  n 


(5.27)  V(t>  *  sup  sr - - - g - 5—5- 

0<h<8  n  1  1  2  +  _2 

Lc  V  0  -h  +  0^  S  Q~  J 


~  1 


whei  e 


(5.28) 


a ,  a 
1  A  2 
c  "  F  +  T' 
®1  ®2 


Since  the  exact  determinat  Ion  of  the  bound  leads  to  certain  algebraic 
difficulties,  we  shall  consider  only  its  asymptotic  form. 


1 

dx 


-L  n 

>  J 
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To  compute  the  suprenum  for  large  n,  we  proceed  a a  follows.  Con¬ 
sideration  of  the  form  of  Inequality  (5.27)  suggests  that  the  supremum 
will  occur  at  a  point  h  which  is  (T(l/n).  Let  us  therefore  make  the 
transformation 


(5.29)  h  “  . 

The  denominator  of  the  right-hand  side  of  Inequality  (5.27)  then  becomes 


'  *1  |  “2  N 

)  +  ^2-  T 

L  ~  1 

1-z/n)  0X  > 

;  +  c©2  J  - 

“f2  ", 

9i  15 

!i  +!i 

91  92 


_1+Ax  -  •/»> 


“V  1-z/n  )  _1» 

say.  Note  that  a  +  b  ■  1.  It  follows  that 

('  .  b  _  n  lo*0  +  T-*7n  ^ 


(5.31)  lim  (a  +  /  -  lim 

n  — >  ®  n  — 


exP  /  losCa+lS75^ 

v  n  — »  ®  - - - - 

I  1/n 


.  1-a 

lim  1-z/n 


feM 
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ez(l-a) 

zb 


where  L'Hospital's  Rule,  with  n  considered  a  continuous  variable,  has 
been  employed  in  the  third  step  of  the  derivation.  It  follows  that  for 
large  n  the  bound  becomes  approximately 


(5.32)  V(t)  ?  sup 


2„  2.  2 

z  /n 


CKz<n  ^Pl^l  'V/*1 


sup 


4  2  2  2 

z  / c 


2  4  *  f  0 

n  (Kz<n  exp^a*^  /Q^cj-l 


6 


4  2 
1  c 
"2  4 


sup 

2 


u 

u 


n£*l  (Ku<ai‘n/61c  e~~ 1 


The  supremum  required  is  precisely  that  encountered  in  the  corresponding 
analysis  in  the  case  of  the  exponential  and  uniform  distributions. 

(Cf.  reference  [2  ].)  From  these  we  conclude  that  the  supremum  occurs 

2 

at  approximately  u  *  1.5936  (an  acceptable  solution  if  n  >  1.59369^c/a^ 
hence  for  sufficiently  large  n) .  By  analogy  with  the  previous  such 
analyses,  we  conclude  that 


(5.33)  V(t)  * 


4  2 
.648©1  c 

2  4 

n°i 


•648V  4  +  “291  >\ 


2  2 
n  a 


v2y 
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Although  this  is  not  an  optimal  result,  the  bound  is  apparently  of  the 
correct  order  of  magnitude.  From  past  experience  it  is  conjectured 
that  the  Fraser-Guttman  will  provide,  in  the  limit,  the  optimal  bound. 

The  solution  of  the  estimation  problem  itself  is  given  by  Chernoff 
and  Rubin  [5  ]  in  a  paper  in  which  this  solution  enables  the  authors 
to  attack  a  more  general  problem  of  estimating  the  location  of  a  dis¬ 
continuity  in  density.  Chernoff  and  Rubin  derive  the  maximum  likeli¬ 
hood  estimator  give  in  equation  (5.24)  and  investigate  its  asymptotic 
properties.  By  a  rather  complex  analysis,  they  deduce  the  limiting 

'*  *  -  2  v 

distribution  of  n(0  -S^)  and  thereby  verify  that,  in  fact,  V(Q^)  “  tf(n  )■ 

We  note  that  this  does  not  complete  the  solution  of  the  mixture 
problem.  An  estimator  of  the  mixing  measure  has  yet  to  be  constructed. 
Although  this  should  not  be  difficult,  the  problem  of  determining  the 
joint  asymptotic  distribution  of  the  estimators  could  provide  some 
difficulty.  This  problem  may  be  investigated  further  in  future  research. 

In  addition,  other  mixtures  of  non-regular  distributions  may  also  be 
considered.  The  motivation  for  this  line  of  investigation  is  as  follows. 
We  have  been  concerned  with  distributions,  such  as  the  Pearson  Type  III 
and  Waibull,  which  have  many  applications  in  areas  such  as  life-testing. 
The  specific  problem  under  investigation  is  that  of  estimating  a  location 
parameter  presumably  different  from  the  origin.  In  the  life-testing 
applications  this  parameter  would  therefore  necessarily  be  positive.  It 
follows  that,  the  distribut rons  considered  give  zero  probability  to  some 
non- degenerate  interval  to  the  right  of  the  origin.  This  does  not  appear 
to  be  a  very  realistic  model.  In  most  life-testing  applications  unusually 
early  failures  occasionally  occur.  It  such  an  unusual  observation  is 
obtained,  then  a  very  misleading  picture  of  the  distribution  can  result 
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by  using  the  estimator  under  consideration  in  spite  of  the  fact  that  the 
given  life  distribution  may  fit  very  well  to  the  remaining  data.  The 
problem  is  further  aggravated  by  the  fact  that  the  estimators  are 
chosen  on  the  basis  of  their  asymptotic  properties,  whereas  it  is  pre¬ 
cisely  in  large  samples  that  such  unusual  observations  can  be  expected 
to  occur.  The  fact  that  early  failures  may  be  "unusual"  in  some  appli¬ 
cations,  i.e.,  may,  in  fact,  be  outliers,  suggests  that  a  more  appropriate 
model  may  be  a  mixture  of  distributions  with  one  component  of  the  mix¬ 
ture  located  at  the  origin  (possibly  with  small  mixing  probability)  and 
one  component  with  positive  location  parameter. 
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APPENDIX 


COMPUTER  PROGRAM  FOR  CALCULATION  OF  BOUND 

Description  of  Program 

MAIN 

The  number  IKONT  is  read  in.  If  non-zero,  it  indicates  that  tables 
of  values  for  the  function  follow,  and  the  routine  TAPEIN  is 

tailed  to  read  these  in.  These  quantities  are  now  read  in:  (All  symbols 
refer  to  inequality  (2.14)) 

K:  Starting  value  for  h 

XN:  Sample  size  n  (in  floating  point  form) 

P:  Starting  value  for  p 
ALPHA:  a 

IPRST:  If  non-zero,  certain  additional  output  is  printed  for  debug 
purposes . 

If  H  *  100.0,  this  signifies  the  end  of  calculations  for  the  current 

ALPHA  value,  we  call  OUTIT,  and  proceed  to  the  next  ALPHA  value,  if  any. 

These  quantities  are  next  read  in: 

S:  Step  size  for  search  procedure  (initial  value) 

FINCR:  Increment  for  use  in  tables  of  g-function 

CST:  Cut-off  value  for  use  when  variation  in  the  variance  bound 
is  small 

FNQl:  Number  of  iterations  to  be  performed  using  interpolation  method 

FNQ2:  Number  of  iterations  to  be  performed  using  exact  method  +  FNQl 
(i.e.  total  number  of  iterations) 

GFAC:  Cut-off  value  for  use,  in  comparison  with  current  gradient  of 
search  path,  to  decide  whether  to  reduce  step  size 

GFLAG2>  When  non-zero,  P  is  constrained  to  be  unity. 
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The  initial  value  of  the  variance  bound  V  is  obtained,  for  the 
initially  specified  p,h  values,  by  callirg  VAR(V).  The  specified 
number  of  interpolation  iterations,  followed  by  the  specified  number 
of  exact  iterations, is  performed  and  the  resulting  variance  bound  is 
printed  out.  The  procedure  is  truncated  if  the  variation  in  the  last 
experimental  design  for  V  becomes  sufficiently  small.  The  terminology 
"experimental  design"  is  used  in  conformity  with  most  papers  on  optimum 
seeking  procedures.  Each  iterac^on  is  performed  by  calling  FUDGIT.  In 
the  case  where  P  is  constrained  to  be  unity,  PFUDG  is  used  instead. 

FUDGIT 

An  experimental  design,  consisting  of  the  four  corners  of  a  square 
is  set  up.  If  the  design  overlaps  the  experimental  boundaries,  the  de¬ 
sign  is  reduced  in  size.  (The  side  of  the  square  is  always  maintained 
at  0.6  times  the  step-size.  Hence  reduction  of  the  size  in  design 
always  implies  reduction  in  step-size,  and  vice  versa.)  The  routine 
VAR(V)  is  called  four  tiroes  to  obtain  V  at  these  four  experimental 
points  and,  if  none  of  the  calculated  values  at  the  four  corners  ex¬ 
ceeds  that  at  the  center,  the  design  is  also  reduced  in  size.  Finally, 
a  is  taken  in  the  direction  of  steepest  ascent.  If  this  results 

in  improvement,  similar  steps  are  taken,  until  no  improvement  is  ob¬ 
served.  If  the  current  point  is  within  step-size  of  any  of  the  last 
six,  the  quantity  NRFLAG  is  set  equal  to  1  and  step-size  reduced  by  the 
factor  0.6.  Otherwise  ISTYMI  is  set  equal  to  1,  and  the  question  of 
whether  to  reduce  step  size  is  dealt  with  in  the  next  iteration,  once 
the  current  gradient  of  the  path  is  known. 
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VAR(  ') 

This  calculates  the  variance  bound  V  u?:ng  equation  (2.14),  and 
obtaining  the  values  for  g( from  GETALL(GVAL1,  GVAL2). 
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where  denotes  the  ordered  pair  (p,h)  with  suitable  origin.  For 

convenience  we  shall  change  variable  nomenclature  to  (x,y).  The  vector 

of  the  g  values  is  denoted  by  (Z  , i»l. . . 6) .  Then  we  have,  for  the 

2  2 

quadratic  surface  A^x  +  k^y  4-  AjXy  +  A^x  +  A$y  +  Afe  »  0, 


Thus 

6 

A(I)  -  E  E (J)G( I , J) . 

J-l 

The  G(I , J)  entries  are  read  in  by  means  of  the  routine  STRATE.  The 
A(I),  I  ■  l,...,  6,  are  calculated  aud  used  to  calculate  the  g-values  at 
(Pq»^  )•  Since  two  forms  of  the  g  function  are  involved  in  inequality 
(2.14),  the  whole  process  is  carried  out  for  each  form,  in  parallel. 
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SEARCH 


The  routine  attempts  to  locate  a  table  entry  for  a  specified  (p,h) 
ordered  pair,  and,  if  it  does  not  find  one,  calculates  the  value  and 
stores  it  for  future  reference.  The  Initial  search  Is  made  in  the  section 
of  the  table  tnc*'  was  read  In,  corresponding  to  the  p-value.  The 
starting  location  of  this  section  is  stored  in  IP(1,IPVAL),  where  IPVAL 
is  p  expressed  in  units  of  FINCR.  If  not  found  there,  we  search  in  the 
blocks  (each  10  cells  long)  whose  starting  locations  are  given  by 
IP(2, IPVAL),  IP(3, IPVAL),...,  IP(10, IPVAL).  If  the  required  h-value  is 
found  in  this  search,  we  take  the  corresponding  values  for  G(.,.,.), 

GVAL1,  and  GVAL2,  and  return. 

If  not,  we  calculate  them,  using  the  routines  VAUJT  and  VALU2,  and 
store  them  (using  the  routine  STORE)  in  the  next  available  location  in 
the  storage  block  —  a.g.,  the  block  whose  starting  location  is  given  by 
IP(2, IPVAL)  --  currently  being  used  (or,  if  exhausted,  assign  a  new  block). 

In  the  storage  area  TAB(. ,.),  the  h  values  are  stored  in  TAB(1, .), 
the  first  form  of  g  in  TAB(2,.),  and  the  second  form  in  TAB(3 , . )  , 

G(B,AC,C) 

This  is  a  numerical  integration  routine  that  calculates  the  function 
defined  in  equatiou  (2.15).  It  takes  special  account  of  the  case  or  <  1 
where  the  ordinate  tends  to  infinity  as  (x-a)  tends  to  zero.  Since 
standard  numerical  integration  methods  are  used,  the  routine  will  not  be 
described  in  detail. 

PFUDG 

This  is  a  maximum  seeking  routine  used  when  p  is  constrained  equal  to 
1  —  i.e.,  the  search  is  performed  in  one  dimension,  with  h  the  variable. 
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An  auxiliary  routine  LARMAX  is  employed.  This  has  been  described  in 
the  main  body  of  this  report.  LARMAX  uses  the  same  routines  described 
above  to  obtain  calculated  values  for  theg(-»-.-)  functions--  no 
interpolation  is  employed. 

REMAINING  ROUTINES 

The  remaining  routines  are  sufficiently  well  described  by  their 
flow-charts  and  listings.  The  flow  charts  are  given  in  Figures  A-l 
through  A- 6,  below.  A  sample  input  sheet  is  given  in  Figure  A- 7.  The 
listing  follows  the  figures. 
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Do,  for  J  -  1  to  150, 
i  IPITH(J)  -  IP(1,J+1) 

!  -  ip(i,j) 


This  ALPHA  value 
teriain/  ted.  Call 
OUT  IT 


Read  In  S,  FINCR,  CST,  FNQ1 ,  FNQ2 ,  GFAC ,  GFLAG2 

Set  GFLAG  »  0 


Set  GFLAG  •  1.0 


Call  FUDGIT 


Calculate  PLIM 
Call  VAR(V) 

Set  NOG,  ISTYMI 


Call  PFUDG 


NH  -  NO 


Use  exact  calculation  lor 
g  lunct ion. 

Set  GFLAG  -  1.0,  VMAX  -  0 


call  wen 


Figure  A-2:  Flow-chart  for  FUDGIT  Routine 


VAR(V) 


Figure  A-3;  Flow-chart  for  VAR  routine 


Start  > 

- - — ■  ■  1  ~ _ 

|  Call  GETALL  (  GVAL1,  GVAL2  ) 


Calculate  V 


|  RETURN 


Figure  A-4:  Flow-chart  for  GETALL  routine 


GETALL  (  GVAL1 . 


GVAL2  ) 


Start  I 


[Call  KXGETL 


c  u 


t 


/U  - 


(Call 


i'UEIAL, 


Call  STRATE?G5  . — . . 

(Gets  matrix  values  for  interpolation) 

Call  KUSS 

[_(Gets  the  six  points  for  use  in  interpolation) 


^ - 

> - 


Do  9000  I  =  1,  6  ”• - 

Call  SEARCH 

(Gets,  from  the  tables  or  by  calculation,  the  g(.,.,.) 
values  for  the  six  points) 

Call  PERM  (orients  the  points  for  application  of  the  matrix) 
Calculate  GVAL1,  GVAL2  by  interpolation 

~7=TT~' '  »  . . . - — - - - 

Write  GVAL1,  GVAL2 

RETURN  v  - - 
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Figure  A- 5;  Flow-chart  for  TAPEIN  routine 


TAPE IN 


|  JJ  -  3 
i  I HOLD  »  3 


READ  IN  IPVAL 
(p  expressed  In  Integer  form, 
with  unit  *  FINCR) 


ipvaC 


RETURN 


IPVAI> - (  >0  ) 

<IPVM>> - —U  150j 

IP(  1,  IPVAL  )  =  JJ 

(Starting  point  for 
table  for  new  p  value) 


Read  in  table  entries:  ITEMP,  TAB2J ,  TAB3J 

ITEMP2,  TAB2K,  TAB3K 

ITEMP3,  TAB2L,  TAB3L 


Store  the  above 
table  entries 


JJ  +  3 


This  p 

value  exhausted 

IHOLDN  = 

JJ 

IPLTH  ■= 

IHOLDN  -  IHOLDO 

m 

length  of  record 

IH0LD0 . * 

IHOLDN 
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If  H  ”  100.0,  go  to  Start  (new  Alpha  value).  Keypunch  decimal  point  in  any  column  of  the  field. 

— 0  A:  In  Column  1»  ”4 "  table  values  follow,  "2"  means  Cards  B,C,  follow.  After  table  values,  read  Card 

again. 

Fig.  A- 7:  Standard  Input  Form 
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LIST  OF  SHARE  LIBRARY  ROUTINES  USED 
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13  ABSTRACT 


The  project  is  a  continuation  of  research  on  problems  in  non-regulir 
estimation  reported  in  ARL  Technical  Documentary  Report  No.  ARL  65-177(1965)* 
Included  in  that  report  was  a  lower  bound  on  the  variance  of  unbiased  estimatcrr 
of  the  location  parameter  of  the  Pearson  Type  III  distribution,  applicable  in 
the  non-regular  case.  This  report  includes  the  results  of  a  numerical  investi¬ 
gation  of  that  bound  for  varying  values  of  the  shape  parameter  of  the  Type  III 
distribution  and  varying  sample  sizes.  The  bound  is  apparently  of  the  correct 
order  of  magnitude  in  a  certain  region  of  the  parameter  space  but  sub-optimal 
elsewhere.  Approximations  to  the  Pitman  estimators  for  location  parameters  are 
investigated  for  both  the  Pearson  Type  III  and  Weibull  distributions.  In  both 
cases,  the  minimum  observation  apparently  contains  the  major  part  of  the  in¬ 
formation  concerning  the  unknown  location  parameter.  Some  results  on  the  non¬ 
regular  estimation  problem,  particularly  concerning  the  derivation  of  variance 
bounds,  in  the  cases  of  densities  with  bounded  domain  depending  on  an  unknown 
parameter  and  of  mixtures  of  uniform  distributions,  are  alsp  discussed. 
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